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Purpose of This Manual 

The purpose of this manual is to enhance the use of formative assess- 
ment practices in the classroom. These formative practices are ones 
that support students while they are learning. The practices also 
ensure that students receive feedback and guidance on their perfor- 
mances in a timely manner so they can make improvements to those 
performances before being evaluated. In the pursuit of enhancing 
formative assessment practices, this publication is designed to help 
teachers do the following: 

1) Clarify science learning targets for their students (CHAPTER TWO). 

2) Understand the range of assessment methods available to 
teachers and explore the alignment of these methods to 
learning targets (CHAPTERS THREE and FOUR). 

3) Analyze assessment information to determine individual 
student learning needs (CHAPTER FIVE). 

4) Modify instructional and/or assessment practices to enhance 
student learning (CHAPTER FIVE). 

5) Reflect on current grading practices and determine if current 
methods accurately portray student achievement (APPENDIX). 

This publication is not intended simply as an informative text but also 
as a resource that encourages self-reflection on current practices. 

We hope you will interact with it, respond to the questions posed, 
implement some of the suggested strategies, and then reflect on 
such implementation. Through these activities, this manual can serve 
as an opportunity for you to examine your science assessment prac- 
tices. It is most helpful if two or more teachers work through the man- 
ual together. The synergy that results from professionals sharing ideas 
will greatly enhance the learning experience. 

Introduction 

Before we can concentrate on "how to assess," we must first define 
assessment What does it mean to "assess"? What is the purpose of 
assessment? Are there different methods of assessment? With a clear 
definition of assessment and a definite purpose given for this activity, 
we can then begin to develop assessments appropriate for science 
classes. 

Assessment simply means a process, strategy, or tool for collecting 
information about the learning process of individuals or groups. Such 
assessment has two broad goals: 1) to provide feedback to students 
about their own learning, and 2) to monitor the effectiveness of the 
instruction. A variety of assessments may be used to monitor the 
effectiveness of instruction, ranging from the large-scale achievement 
tests used to calculate Adequate Yearly Progress under the No Child 
Left Behind Act down to the observation of students' facial expres- 
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sions by a lecturing teacher. Students can also receive feedback in 
diverse ways (e.g., comments on a lab report or scores on a high- 
stakes test). 

In this publication, however, we will concentrate on assessments 
that occur on a day-to-day basis within the science classroom. 

Such classroom assessments constitute an ongoing process in which 
teachers and students interact to promote enhanced student learn- 
ing. This process involves using a variety of assessment strategies and 
tools to collect information about student learning and then using 
the data collected to diagnose learning problems, monitor student 
progress, or provide meaningful and timely feedback to students. 
Therefore, the main purpose of this publication is to promote an 
assessment process that enhances student achievement in science. 

This purpose of using classroom assessment to improve student per- 
formance is echoed throughout the current classroom assessment 
research literature. Some (Black, Harrison, Lee, Marshall, & Wiliam, 
2004; Black & Wiliam, 1998) label such assessments as formative 
in nature and emphasize that assessment is formative "when the 
evidence is actually used to adapt the teaching to meet student 
needs" (Black & Wiliam, 1998, p. 140). Chappuis, Stiggins, Arter, & 
Chappuis (2004, p. 35), echoing a term coined by Black and Wiliam, 
characterize this enhancing of student performance assessment pro- 
cess as "assessment for learning" and state that when "they assess 
for learning, teachers use classroom assessment and the continuous 
flow of information about student achievement that it provides to 
advance, not merely check on, student learning." Such assessment 
for learning is assessment that helps students identify the strengths 
and weaknesses of their performance so that they can improve their 
achievement. It is differentiated from "assessment of learning," which 
simply provides a means of rating students, or comparing them to 
one another. Assessment of learning, unlike assessment for learning, 
does not focus on feedback for improvement. 

What is the Classroom Assessment Cycle? 

In order to fulfill the goals listed above and to promote assessment for 
learning, we must first explain the Classroom Assessment Cycle. This 
cycle outlines an assessment process that focuses on improving stu- 
dent performance. The four main steps of the cycle are: 

1) Clarifying learning targets 

2) Gathering evidence in a variety of ways 

3) Analyzing assessment data 

4) Modifying instruction 

Let us examine a common assessment scenario and analyze it to see 
how it conforms to each step in the Classroom Assessment Cycle. 

SCENARIO ONE: Mr. Wei, a sixth-grade teacher, teaches a unit 
on soii formation and then gives a unit test with muitipie- 
choice, true/faise, and matching items to assess students’ 
retention of the information. Students are toid about the 
test one week in advance and are instructed not to bring 
resource materiais with them to the test. Students’ tests are 
scored and returned and form the basis of the six weeks’ 
grade. The teacher moves on to a new unit. 
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CLARIFYING LEARNING TARGETS: There is no mention in the above sce- 
nario of iearning targets. What is important for the students to know 
about soii formation? Are the important concepts drawn from the 
sixth-grade curricuium? Does the teacher expect the students to sim- 
piy iearn factuai information, or are other concepts important? For 
instance, did they iearn skiiis in operating scientific equipment, criticai 
thinking skiiis, skiiis in creating such products as iab reports, papers, 
modeis, etc.? For the scenario to exempiify the Ciassroom Assessment 
Cycie, the teacher wouid need to set particuiar iearning targets for 
the instruction, pian the assessments and the instruction based upon 
these targets, and most importantiy, share these learning targets with 
students prior to the actuai instruction. 

GATHERING EVIDENCE IN A VARIETY OF WAYS: The scenario mentions oniy 
one assessment— the unit test, it does not mention any formative 
assessments that may have prepared students to take this test. For 
exampie, did the teacher check comprehension of concepts before 
scheduiing the test? Fiow was this done? What day-to-day prac- 
tices heiped assure the teacher that students were ready for the unit 
test? Since this test is the oniy assessment mentioned and because 
it primariiy uses oniy one form of assessment (that of the seiected 
response), this scenario does not support the idea of gathering evi- 
dence in a variety of ways. 

Selected responses are those in which students choose their answers 
from a given iist, as with muitipie-choice, true/faise, or matching ques- 
tions. Mr. Wei's test did not incorporate opportunities for students to 
construct their own answers. Student-created essays, short-answers, 
graphs, or iiiustrations wouid aii be exampies of constructed responses 
that couid provide a window into student thinking and reveai student 
understanding of the concepts being tested. 



ANALYZING ASSESSMENT DATA: The teacher apparently did not collect 
any data to determine that students were prepared for the test, 
and he also failed to provide opportunities for students to receive 
feedback on their performances before being tested. The testing 
data itself does not appear to be analyzed, as the teacher immedi- 
ately moves on to another unit. To comply with this quadrant of the 
Classroom Assessment Cycle, the teacher would need to use the 
assessment data to make inferences about student learning, focus- 
ing not only on the whole class, but on individual student learning 
needs. The assessment data would need to be collected prior to 
instruction, during instruction, and after instruction. Mr. Wei appears 
to only collect data otter instruction, and even this data he does not 
appear to analyze. 

MODIFYING INSTRUCTION: Since the assessment data from the ending 
exam were not analyzed, no instructional modifications were mode. 
From the scenario, it appears that the teacher will simply move on to 
o new unit, regardless of the assessment results. 

If Scenario One does not comply with the precepts of the Classroom 
Assessment Cycle, what might such compliance look like? Scenario 
Two provides a classroom exemplar of this cycle in action. 

SCENARIO TWO: Mr. Rohib, a high-school chemistry teacher 
in Florida, is planning a unit on the Periodic Table of 
Elements. Mr. Rohib wonts his students to understand the 
organization of this table and be able to use the table to 
predict properties of particular elements. Before planning 
the unit, he consults the state standards and chooses two 
that are appropriate for the unit, including “The students 
will know that elements are arranged into groups and fam- 
ilies based on similarities in electron structure and that their 
physical and chemical properties can be predicted” and 
“The students will know that investigations are conducted 
to explore new phenomena, to check on previous results, 
to test how well a theory predicts, and to compare different 
theories.” Mr. Rahib decides that the culminating assess- 
ment will be a performance-based one in which students 
determine the physical and chemical properties of partic- 
ular elements and then predict the physical and chemical 
properties of untested elements. To prepare students for 
this assessment, Mr. Rahib decides to: a) assign textbook 
readings, b) hold class discussions of these readings, c) 
give a lecture on the atomic theory, d) provide students 
with a set of cards showing physical characteristics of per- 
sons and then help them organize the cards into “families” 
and predict how missing family members may look, e) ask 
students to provide explanations of the atomic theory and 
of the organization of the periodic table in their science 
journals, and f) assign a laboratory investigation of physi- 
cal and chemical properties of particular elements, with 
a concomitant laboratory report. He then designs rubrics 
(scoring guides) to help him assess the class discussions, 
science journals, and lab reports, as well as one for the 
culminating assessment. He shares the learning targets 
with students, using language they understand, before 
beginning the unit. As students progress through the unit, 
they are provided with written feedback via the rubrics as 



well as verbal feedback from Mr. Rahib. Mr. Rahib uses 
these formative assessments (a-f, above) to align his 
Instruction with student learning needs and to determine 
when students are adequately prepared for the culminat- 
ing assessment. He uses both the formative and the culmi- 
nating (summative) assessment results to determine which 
students hove difficulties in making predictions and selects 
instructional activities that will aid these students. These 
may include one-on-one instruction, peer tutoring, use of 
science manipulatives, etc. 

In comparing the two scenarios, we see that the students in Mr. 
Rahib's ciass are provided with the iearning targets prior to instruc- 
tion, they have muitipie opportunities to receive feedback on their 
iearning, and the assessments utiiized are varied and diverse. Mr. 
Rahib constantiy monitors student iearning via scoring rubrics, ana- 
iyzes the assessment data to determine if further instruction is neces- 
sary, and then modifies instruction accordingiy, both for the entire 
ciass and for individuai students, in Mr. Rahib's ciass, it is aiso evident 
that students are expected to do more than simpiy iearn facts. This 
did not appear to be the case in Mr. Wei's ciass. 

Traditionaiiy, the goai of most subject area courses (iike Mr. Wei's sci- 
ence ciass) has been for students to be abie to recognize or recaii 
important facts, concepts, or reiationships that have been expiained 
to them, in the past, we have aimost exciusiveiy vaiued students' suc- 
cess at retaining and bringing forth a sampie of the information they 
have retained. When a teacher emphasizes factuai knowiedge on 
a test, students conciude that remembering facts is the goai. When 
students are not given an opportunity to retest or improve their work, 
they may conciude that improvement is not vaiued. if higher-order 
thinking, probiem-soiving, and criticai thinking are vaiued, then 
ciassroom assessments shouid iend vaiue to them, it is important to 
remember that how and what we test sends a ciear message to stu- 
dents about what is vaiued. 

A key point to remember as you go through this manuai is that assess- 
ing students invoives gathering information about what they know 
or can do. if throughout 12 years of schooi students are assessed 
oniy on passive, non-creative work (worksheets, muitipie-choice 
tests), how iikeiy is it that they wiii become probiem-soivers, creative 
producers, effective communicators, and seif-directed iearners? in 
Chapter Two, we wiii expiore a variety of iearning targets for students 
that wiii enabie them to reach their fuii potentiais. 



APPLICATION 



You may wish to reflect upon your own current 
assessment practices to determine if you ore 
using oii the quadrants in the Ciassroom 
Assessment Cycie with your science ciosses. 
Figure 1 .2 may be of help in this process. 

V 








FIGURE 1.2 

Reflecting on Your Assessment Practices 

Directions: Place a check in the column that best matches your 
current practices. 



STATEMENT 


OFTEN 


SOMETIMES 


RARELY 


NEVER 


1. 1 use the state curriculum to help identify 
student learning targets. 










2. 1 share the learning targets with students 
before beginning instruction. 










3. 1 emphasize many types of learning targets 
in my teaching, not relying on factual 
learning targets alone. 










4. 1 plan my assessments before planning my 
instruction. 










5. 1 incorporate multiple formative 

assessments within each unit, so students 
have many opportunities to receive 
feedback on their learning. 










6. 1 create rubrics (scoring guides) to help 
students understand their academic 
weaknesses and strengths. 










7. 1 examine assessment data and make 
inferences about whole class and 
individual student learning. 










8. 1 use formative assessment data to revise/ 
retool my teaching within the unit. 










9. 1 use formative assessment data to 
individualize instruction for struggling 
students. 











To aid in identifying areas on which you may wish to concentrate, 
consider the foiiowing: 

STATEMENTS 1-3 reiate to the "Ciarify Learning Targets" quadrant of the 
Ciassroom Assessment Cycie. 

STATEMENTS 4-6 reiate to the "Gather Evidence in a Variety of Ways" 
quadrant of the Ciassroom Assessment Cycie. 

STATEMENTS 7 reiates to the "Anaiyze Assessment Data" quadrant of 
the Ciassroom Assessment Cycie. 

STATEMENTS 8-9 reiate to the "Modify instruction" quadrant of the 
Ciassroom Assessment Cycie. 




Clarifying i-earning 
Targets 



What Do We Want Students to Know 
and Be Able to Do? 



Before we can assess and teach our students, we must first decide 
what we want them to know and be abie to do. We usuaiiy start with 
broad goais and then break those down into smaiier components. 
For exampie. The National Science Education Standards pubiica- 
tion (1996, p. ix), written by the Nationai Research Council begins 
with this goai statement: "This nation has estabiished as a goai that 
aii students shouid achieve scientific 
iiteracy." The bookiet goes on to 
describe iiteracy: "Scientific iit- 
eracy enabies peopie to use 
scientific principies and pro- 
cesses in making personai 
decisions and to participate 
in discussions of scientific 
issues that affect society" 

(1996, p. ix). 



Clarify 
Learning 
Targets 



With this description, the Nationai Research Councii begins to break 
its overaii goai (scientific iiteracy) into smaiier component parts, as 
depicted in FIGURE 2.1. One component emphasizes scientific knowi- 
edge of concepts/principies, whiie the other emphasizes knowiedge 
of scientific processes. 



'Scientific MTERAcy 






/ 



X 



J 



Knowledge of 
scientific principles 



Ability to use 
scientific processes 



FIGURE 2.1 



To aid students in acquiring knowiedge of scientific principies and 
processes, the Nationai Research Councii has written content stan- 
dards within eight different categories: 

• Unifying concepts and processes in science 

• Science as inquiry 

• Physicai science 
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• Life science 

• Earth and space science 

• Science and technoiogy 

• Science in personai and sociai perspectives 

• History and nature of science (1996, p. 104) 

Within each of these eight categories, we find the actuai content 
standards. Each science content standard is then further expiicated 
by a "Guide to the Content Standard" section. FIGURE 2.2 dispiays this 
organizationai scheme graphicaiiy using exampies from the K-4 ievei. 



CATEGOi^y: 'Science as iNO-UiRy 



Content Standard A: 

As a result of activities in grades K-4, all students should develop: 

• Abilities necessary to do scientific inquiry 

• Understanding about scientific inquiry (1996, p. 121) 



The Guide to the Content Standard, "abiiities necessary to do 
scientific inquiry," inciudes the foiiowing tasks that students wiii be 
expected to accompiish: 

• Ask a question about objects, organisms, and events in the 
environment 

• Pian and conduct a simpie investigation 

• Empioy simpie equipment and toois to gather data and 
extend the senses 

• Use data to construct a reasonabie expianation 

• Communicate investigations and expianations (1 996, p. 1 22) 



in this manner, the Nationai Science Standards are broken down from 
compiex, generaiized statements to more specific ones. The tasks 
iisted in FIGURE 2.2 under the "Guide to the Content Standard" cat- 
egory show the most specific ievei inciuded in the Nationai Science 
Standards. The impiication is that if students can compiete aii these 
tasks successfuiiy, then they have the "abiiities necessary to do scien- 
tific inquiry." 

Like the Nationai Science Education Standards, your own state curric- 
uium may begin with broad goais that are then dissected into increas- 
ingiy smaiier component parts. For exampie, in Fiorida, the state sci- 
ence curricuium begins with the subject area (science), then breaks 
this into strands (The Nature of Matter, Energy, Force and Motion, 
Processes That Shape the Earth, Earth and Space, Processes of Life, 
and How Living Things interact With Their Environment). Under each 
of these strands, a number of standards further expiain what students 
shouid know and be abie to do. The standards are then supported by 
benchmarks, which occur as the most specific ievei of the curricuiar 
hierarchy. Let's take a iook at exampies of these different ieveis from 
the K-2 science curricuium dispiayed in FIGURE 2.3. (Sunshine State 
Standards, 2005, www.firn.edu/cloe/curric/prekl2/pclf/prk-2g.pclf). 






GRADE LEVEL; K 2 



SUBJECT AREA: Science 
STRAND: Nature of Science 



STANDARD: The student understands that aii matter has 
observabie, measurabie properties. 

BENCHMARKS: 

1 . Knows that objects con be described, ciassified and 
compared by their composition (e.g., wood or metoi) and 
their physicoi properties (e.g., coior, size, and shape). 

2. Recognizes that the some materioi can exist in different 
states 

3. Verifies that things con be done to materiais to change 
some of their physicoi properties (e.g., cutting, heating, 
and freezing), but not oii materiais respond the some way 
(e.g., heating causes water to boii and sugar to meit) 



in this figure, we note the same trend we observed in the Nationai 
Science Education Standards. Each set begins with more generai 
goais and moves to higher 
specificity. 



APPLICATION 



Examine your state science curricuium. 

Do you find this some trend— from generoiities 
to specificities— within your curricuium? 



Of course, greater specificity 
makes it easier for the teacher to 
determine what students are required 
to know and be abie to do. Such 
specificity enabies teachers to pian 
meaningfui instruction as they know 

what concepts students wiii need to understand and what abiiities 
students wiii need to deveiop. One of the authors of this pubiication 
once experienced some confusion with a state science standard 
that read, "Students wiii appreciate the diversity of iife." Because of 
the generai nature of this statement and the huge scope of scien- 
tific concepts this standard couid encompass, it was hard to know 
exactiy what content knowiedge was being required of the students! 
More specificity, as is now the trend in state and nationai curricuia, 
certainiy makes written goais easier to interpret and impiement. 



Oontent standards and benchmarks, then, heip teachers under- 
stand what students need to know and be abie to do. Teachers 
use this knowiedge to pian meaningfui instruction that wiii prepare 
students for assessments based upon these standards and bench- 
marks. However, are standards and benchmarks truiy specific 
enough? Wiii teachers compieteiy understand how they shouid 
assess and how they shouid teach by simpiy iooking at the standards 
and benchmarks for their science courses? We have often found 
that more ciarification is needed for teachers to truiy understand 
what they are required to teach. Therefore, this next section of the 






manual describes ways to "unpack" the standards and benchmarks 
so that the intent of each can be clearly understood. 



Learning Targets 

Some professional developers believe categorizing standards and 
benchmarks into learning targets can aid in this unpacking process 
(Stiggins, 2001; Regional Educational Laboratories, 1998; Butler & 
McMunn, 2006). Stiggins (2001, p. 66) describes five categories of 
such learning targets: 

• Knowledge 

• Reasoning 

• Performance Skills 

• Products 

• Disposition 



Let's take some state science competencies (standards) and objec- 
tives (benchmarks) from North Carolina's science curriculum and see 
how they can be further classified into learning target categories. 
(See FIGURE 2.4.) 



EXAAAPt.ES OF learning TARGETS 

Found in North Carolina Science 
Standards and Benchaaark's 



Knowledge. 



6^*^ GRADE: Explain the model for the interior of the earth. 

6™ GRADE: Desoribe the flow of energy and matter in natural systems. 



Reasoning 

6™ GRADE: Analyze evidenoe to explain observation, make 

inferenoes and prediotions, develop the relationship between 
evidenoe and explanation. 

6^*^ GRADE: Evaluate teohnologioal designs for applioation of soientifio 
prinoiples, risks & benefits, oonstraints of design, oonsistent 
testing protoools. 

PERFORAAANCE SiTU-LS 

6^*^ GRADE: Use oral and written language to oommunioate findings, 
defend oonolusions of soientifio investigations. 

9™ GRADE: Design and oonduot soientifio investigations to answer 
biologioal questions. 

Products 

GRADE: Using shadows, follow and reoord the apparent 
movement of the sun in the sky during the day. 

6™ GRADE: Prepare models and/or oomputer simulations to test 
hypotheses, evaluate how data fit. 

Source: North Carolina Dept, of Publio Insfruofion, Scienoe Sfandarol Course of Study 
and Grade Level Campetenoies, www.dpi.state.nc.us/curriculum/science/standard/toc.html 




KNOWLEDGE TARGETS are analogous to the lower levels of Bloom's 
Taxonomy (Recall and Comprehension). You might think of knowl- 
edge targets os "knowing the facts." You can recognize standards 
and benchmarks that would be classified as knowledge targets 
because they often start with o few key verbs: identify, list explain, 
describe, know. Knowledge targets ore Important because all of the 
higher-level thinking skills depend upon students possessing o knowl- 
edge base. Therefore, when we ask students to learn such laws/prln- 
clples as, "For every action, there Is an equal and opposite reaction" 
or "Matter Is neither created nor destroyed," we ore using knowledge 
targets. Knowledge targets also encompass such vocabulary defini- 
tions as, "matter," "photosynthesis," and "metamorphosis." 

REASONING TARGETS abound In science classes. The key verbs found In 
standards and benchmarks that denote such reasoning targets are: 
classify, categorize, compare, contrast analyze, evaluate. When we 
ask students to differentiate between eukaryotes and prokaryotes or 
to classify elements os solids, liquids, or gases at room temperatures, 
we are using reasoning targets. Writing conclusions to lob reports of 
scientific Investigations will also call for reasoning as students must 
reflect on their observations, look for patterns, form Inferences, and 
then write rationales for these Inferences. This analysis process Is defi- 
nitely o reasoning one. Problem-solving activities also fall within the 
reasoning targets category, especially when students must evaluate 
several alternative solutions and choose the "best" one. 

Students learning to operate scientific equipment, make oral pre- 
sentations, use graphing calculators or use computer applications 
are oil utilizing PERFORMANCE SKILLS TARGETS. Teachers con recognize 
standards and benchmarks that coll for performance skills when they 
see such key verbs as: demonstrate, show, operate, use, assemble, 
conduct In science class, students ore called upon to learn many 
different types of skills. These may range from such abstract skills os 
comprehending subject-specific text to more tangible, concrete skills 
as focusing a microscope. Each skill Is Important and will old students 
In becoming more scientifically literate and adept. 



Besides learning skills In science class, students also are required 
to construct products. Students may keep science journals, write 
lab reports, construct models, create Illustrations, make posters, 
etc. All of these would qualify os products and would fall under the 
PRODUCTS LEARNING TARGET category. Key verbs to look for In standards 
and benchmarks for product learning targets are: make, construct 
depict write, create, design, record. 



DISPOSITIONAL TARGETS encompass the 
"unwritten" expectations we hove for 
students. Some of these expectations 
may Include: 

• Be self-motivated. 

• Exhibit positive attitudes. 

• Work cooperatively. 

• View self os capable. 



RESOURCE 



Sample dispositional targets 
used in Georgia may be 
viewed at www.glc.k12.ga.us. 
These targets are termed 
“Character Education 
Objectives” and are required 
to be taught in all K-12 
subject areas. 



Such dispositional learning targets 
do not usually appear in the written 

objectives or benchmarks for an academic subject. These targets 
ore not usually graded; it is more common for them to be assessed 





informally. They are also more susceptible than the other achieve- 
ment targets to "teacher-pleasing" or socially acceptable responses. 
Dispositional targets are rarely entirely cognitive in nature; they are 
more motivational and affective. Just because these targets do not 
appear in curricula, however, does not mean that teachers do not 
have such goals in mind for students. 

Why Should We Classify Standards and 
Benchmarks Into Learning Target Categories? 

We have examined some science standards and benchmarks and 
then classified them into learning target categories. Why is this impor- 
tant or necessary? The primary reason for doing so is to clarify, as spe- 
cifically as possible, what we want students to know and be able to 
do. We need to understand if the standard or benchmark is asking for 
students to learn a skill, use reasoning or critical thinking, know basic 
facts, or create a quality product. 

We have already stated that increased specificity helps the teacher 
understand the learning requirements for students. The use of learn- 
ing targets simply enhances the specificity of the standard or bench- 
mark. When we examine standards and benchmarks for embedded 
learning targets, we are "unpacking" these statements and teasing 
out the "meat" or substance from them. We are using the learning 
targets to help us understand exactly what students should know 
and be able to do. In the next chapter, we will learn that knowing 
the learning targets can aid us in creating valid assessments— assess- 
ments that truly test whether a student has met a particular target. 

Unpacking the Curriculum for a Unit of Study 

Before we move to the next chapter, let's take a look at a fifth-grade 
unit of study to see how learning targets may overlap and intercon- 
nect to teach science concepts. This unit of study covers earth and 
space science concepts. The teacher, Mrs. Vega, consults the state 
standards for Mississippi and finds the following standard and bench- 
marks (Mississippi State Curriculum Frameworks, http://marcopolo.mde. 
kl2.ms.us/frameworks/science/sci_05.html): 

6. Investigate the structure of the Earth. 

a. Investigate the structure of the atmosphere (gas-air), 
hydrosphere (liquid-water), and lithosphere (solid-land). 

b. Examine how organisms affect the composition of the Earth 
and its atmosphere. 

c. Analyze processes that cause changes on Earth. 

d. Explore fossils as indicators of how life and environmental 
conditions have changed. 

FIGURE 2.5 displays a unit planning sheet Mrs. Vega developed for 
this Earth Science unit. In this planning sheet, Mrs. Vega unpacks the 
curriculum into embedded learning targets and then begins to list 
assessments that the students will do to prove they successfully met 
each learning target. She then brainstorms possible instructional 
activities that will prepare students for these assessments. By creating 
such a planning sheet, Mrs. Vega ensures that she actually addresses 
each of the learning targets embedded in her curriculum. 



FIGURE 2.5 

Unit Planning Sheet 



LEARNING 

TARGETS 


SOURCE IN 
CURRICULUM 


PCSSIBLE 

ASSESSMENTS 


POSSIBLE 

INSTRUCTIONAL 

ACTIVITIES 


KNOWLEDGE 


6a-d: All require 
some knowledge, os 
vocabulary definitions, 
components of the 
Earth and atmosphere, 
knowledge of 
processes 


• KWL sheet 

• Vocabulary 
worksheet: 
Fill-in-the-blanks 

• Vocabulary quiz: 
Matching 

• Solid, Liquid, Gas 
worksheet: Label and 
Matching 

• Journal entries 


• Textbook readings 

• Teacher lecture 

• Comprehension 
questions 

• Vocabulary wall 

• Demonstrations 

• Modeling 

• Class discussions 

• Pair shares 


REASONING 


6b: Analysis of effects 

6c: Analysis of 
processes 

6d: Analysis of how 
fossils are indicators 
and how life has 
changed 


• Short-answer 
questions 

• Concept-map rubric 

• Higher-order questions 

• Cause/effect tables 


• Brainstorming 

• Concept mapping 

• Demonstrations 

• Inquiry-based 
experiments 

• Seminars/class 
discussions 

• Before/after pictures 


PERFORMANCE 

SKILLS 


6a: Must learn skills 
needed to conduct an 
investigation (process 
skills) 


• Observation of Skills 
checklist 

• Writing hypotheses 
worksheet 


• Demonstrations 

• Inquiry-based 
experiments 

• Set class a problem to 
solve— have students 
note steps they took to 
solve it 


PRODUCTS 


6b: Composition of 
Earth model 

6d: Fossil experiment 
lab report 


• Lab Report Rubric 

• Practice lab report 

• Earth Composition 
Model Rubric 


• Work together to create 
rubrics 

• Critique products from 
previous years 


DISPOSITIONS 


None in curriculum, 
but want students 
to be able to work 
cooperatively in groups 


• Teacher observations 

• Notify rule-breakers of 
particular rule broken 


• Brainstorm rules for group 
work 

• Prioritize rules 



This planning sheet provides Mrs. Vega with a good basic unit plan. 
She now needs to create a pacing guide (e.g.. How many days will 
the unit take? What will be done each day? Which particular bench- 
marks will be covered each day?) and choose the specific activities 
(e.g.. What experiments will the students do? How will inquiry activities 
be implemented? What readings do they need to complete? What 
problem will be used to introduce performance skills in investigations?) 
and design the assessments (e.g.. What short-answer questions 
will be asked? What criteria will be used to judge the model of the 
Earth's composition?) Once these questions ore answered, Mrs. Vega 
can create her daily lesson plans feeling confident that these plans 
include instruction and assessments aligned with the learning targets 
found in the curriculum. 




As an example of moving from the unit plan fo o doily lesson plan, 
let's look of the reasoning target in FIGURE 2.5. Mrs. Vega felt that 
reasoning was embedded in three of the four benchmarks. The 
key words she observed in these were "examine," "analyze," and 
"explore." In all three benchmarks, there appeared to be cause-and- 
effect relationships, as students were asked to examine how organ- 
isms affected the composition of the Earth, to analyze processes that 
caused changes on the Earth, and to explore how fossils served as 
indicators of change. To successfully complete these benchmarks, 
students will need to understand cause and effect and be able to 
explore relationships. Mrs. Vega is going to use brainstorming activi- 
ties, graphic organizers, class discussions, and some inquiry activities 
(hands-on experimentation, student-designed labs) to help students 
make connections between cause and effect. For example, she 
might begin with benchmark 6c by showing pictures of the rock for- 
mations in Monument Valley, Utah, and asking students to brainstorm 
how the monuments were formed. Through a class sharing/discussion 
session, the word "erosion" may be introduced to the class. Once 
this word appears, Mrs. Vega may then ask how erosion occurs (e.g.. 
What are the agents of erosion?). Once such agents as wind, water, 
and frost have been identified by the students, Mrs. Vega may ask 
students to draw a "before" picture for one of the Monument Valley 
photographs, identifying the causal agent for the change. Similarly, 
the class may examine "before" and "after" pictures of Mt. St. Helens. 
Here, Mrs. Vega may ask the cause of the change seen. Through 
such demonstrations and class discussions, Mrs. Vega is introducing 
cause and effect. Later, Mrs. Vega may ask students to manipulate 
paper or stacks of colored towels to simulate mountain building. 
Again, she will draw students' attention to the cause (force applied) 
and the effect (mountain building). She may conclude this lesson 
on cause and effect by asking students to create a concept map 
around the phrase "forces that shape the Earth." 

ONE FINAL NOTE: Learning targets should be shared with students prior 
to the actual instruction or assessment of that learning target. Knowing 
the target is necessary for successful learning to occur. Think back on 
your own experiences as a student. Was there a time when you did 
not understand what you were supposed to learn? Was this a frustrat- 
ing experience for you? Or, have you ever misjudged the teacher's 
expectations? When did you find out you had made this misjudg- 
ment? Was it after you received a grade on an assignment? Could 
you have done better on the assignment if you had simply understood 
what you were supposed to learn? 

When we share the learning targets with students upfront, we are 
making it possible for them to hit the target. When we fail to share the 
learning targets, we are asking students to shoot their arrows at an 
invisible (or perhaps moving) target. 



APPLICATION 



You may practice unpacking standards and 
benchmarks into component iearning targets 
by completing the matching task in Figure 2.6. 

OR 

You may create a unit planning sheet similar 
to the one in Figure 2.5 for your next unit. 

^ 








FIGURE 2.6 

Practice in Classifying Learning Targets 



Directions: Match the standards or benchmarks on the right side to the 
correct teaming target category. 



LEARNING TARGET 
CATEGORY 



STANDARD OR BENCHMARK 



a. Knowledge 

b. Reasoning 

c. Performance Skills 

d. Products 

e. Dispositions 



1. Create a model of the cell and label all parts. 

2. Persevere to overcome roadblocks. 

3. Identify the parts of a microscope. 

4. Make a hypothesis. 

5. Analyze the effects on solubility by changing the temperature of 
the solution. 

6. Demonstrate the operation of a Bunsen burner using correct 
safety procedures. 

7. Work cooperatively in lab, sharing equipment equitably. 

8. List characteristics of living things. 

9. Write a lab report. 

10. Share scientific findings with the class in an oral report. 



Answers: l.d, 2.e, 3. a., 4.b, 5.b, 6.c, 7.e, 8. a, 9. d, lO.c 




CHAPTER THREE 




Gathering Evidence 
IN k Variety of i^ays 

METHODS OF ASSESSMENT 



In Chapter Two, we explored documents that help teachers determine 
what students should know and be able to do. Specifically, we looked 
at the National Science Education Standards and at some examples 
of state standards. We noted o trend for standards to start os general 
statements and become Increasingly specific os they ore broken Into 
objectives or benchmarks. Then, 
mented with unpacking such 
statements Into component 
learning targets: Knowledge, 

Reasoning, Performance Skills, 

Products, and Dispositions. In 
this chapter, we will empha- 
size aligning our assessments 
with these learning targets. In 
order to create valid assess- 
ments. Assessments ore valid 
when they actually provide Infoi 
motion about student achievement of 

the Intended learning target. An example of on Invalid assessment of 
the ability to use a microscope correctly (a Performance Skills learning 
target) would be to give a pencll-and-paper test on the ports of the 
microscope. A more valid assessment would be to hand the student 
a slide and hove him focus this under low and high power. Before we 
can begin creating these valid assessments, we need to look at the 
different methods of assessment available for teachers to use. 

Methods of Assessment 

FIGURE 3.1 graphically displays several different methods of assessment 
that we will reference In this chapter. As the table shows, one way 
that methods of assessment con be categorized Involves the use of 
two main headings: Selected Response and Constructed Response. 
Selected response assessments provide students with a list of possible 
answers. The students then select on appropriate answer from this list. 
Examples of Selected Response assessments Include multiple choice 
questions, true/false questions, matching questions, and some- 
times flll-ln-the-blonk questions (If students are given a word bonk of 
answers from which to choose their response). Constructed Response 
assessments, unlike selected response, require students to create their 
own answers. FIGURE 3.1 shows that constructed responses con be fur- 
ther sub-dIvIded Into Products and Performances. Short-answer ques- 
tions and essay questions would foil under the Constructed Response- 
Product subcategory. Here, students ore asked to construct their own 
responses to the question and to create a written record (a product). 
Other products listed here Include science logs or journals, research 
papers, student-constructed graphs, portfolios, notebooks, and such 
graphical organizers os flow charts or concept maps. 



we experi- 



Gather 
Evidence in 
a Variety 
of Ways 






FIGURE 3.1 

Assessment Methods and Approaches 



SELECTED RESPONSES 


CONSTRUCTED RESPONSES 




Products 


Performances 


Multiple Choice 
True/False 
Matching 
Fill-in-the-Blank 


Essays 

Logs 

Journals 

Graphing 

Portfolios 

Paragraphs (short answers) 
Notebooks 
Flow Charts 
Concept Mops 
Research Papers 


Laboratory Practical Exam 
Oral Presentations 
Demonstrations 
Dramatizations 
Role Playing 
Debates 

Panel Discussions 
Musical Recitals 
Movements 


< — Questioning — ^ 


— Formal and Informal Observations — 


4- 


— Teacher/Student Dialogues — 





A laboratory practical exam, in which students ore required to dem- 
onstrate safety procedures, might be on example of o Constructed 
Response-Performance assessment. Here, students might demon- 
strate appropriate pippetting skills, the proper way to dilute acids, or 
the safe way to heat o test tube. During these tasks, students would 
hardly be selecting their responses from o list; rather, they would be 
actively involved in demonstrating the acquired skills. Other perfor- 
mances that might occur in science classes include: 

• Oral presentations of scientific findings (e.g., students present 
results of experiments or talk to judges at the science fair 
about their projects or students utilize whiteboards and 
multimedia displays to explain findings.) 

• Dramatization of on important "moment in science" 

(e.g., the discovery of penicillin, invention of the microscope.) 

• Taking on o role to solve o problem (e.g., being the "town 
member" or the "mosquito control board member" in o 
problem related to mosquito infestation.) 

• Debates in which students support different sides of a 
question (e.g., stem cell research.) 

• Panel discussions (i.e., each student on the panel is on expert 
in o particular area. The class attempts to solve o problem by 
calling on the panel for answers.) 

• Musical recitals (e.g., students create raps or songs to teach 
scientific concepts.) 




• Movements (e.g., students create an "electron dance" to 
show quantum levels or students become codons and dance 
protein synthesis.) 



There are three other Important methods of assessment depicted on 
the chart In FIGURE 3.1. These are Questioning, Formai and Informai 
Observations, and Teaoher/Student Diaiogues. 



QUESTIONING Is perhaps the most common type of classroom assess- 
ment. Brown and Edmondson (1984) reported that as many as 100 
questions per class hour may be asked In elementary or secondary 
classrooms. For example, as teachers explain concepts, they often 
ask comprehension questions. A teacher explaining photosynthesis 

might ask at the end of the 

lesson, "What name Is given 
to the process plants used 
to make food?" Or, he may 
ask, "What are the products 
of photosynthesis?" 



RESOURCE 



“Questions Are the Answer” 
This orticie by John E. Penick, 
Undo W. Crow, and Ronoid J. 
Bonnstetter con be found in 
Science Teacher, Volume 63, 
Number 7 (January 1996) on 
pages 26-29. 



Such comprehension ques- 
tions, asking for a simple 
recall of facts comprise 
70-80% of all questions 
asked in classrooms (Borich, 

2000). If teachers only use 
questions to check compre- 
hension, they are under-utilizing this valuable assessment method. 
Questions, like other assessment methods, can stimulate student 
thinking and thereby enhance learning. Such questions are deemed 
effective questions by Borich (2000, p. 238), who states that any ques- 
tion that "evokes a response that actively engages a student in the 
learning process" is an effective question. 



APPLICATION 



Consider the following excerpt with accompanying 
comprehension (recall) questions. Answer each question. Then 
reflect: If students give correct answers, con you be sure they 
truly understand the concepts? 



It is very important that you learn about mexicathon. Mexicathon is a new 
torm ot diodine. It is tactographed in Guanocartera. The Guanocarterrans 
marborate large amounts ot pavoline and then montisorate it to 
bractorize mexicathon. Mexicathon may well be one ot our most 
symbonized chethorens in the tuture because ot our tostulotor zenorith. 

1 . What is mexicathon? 

2. Where is mexicathon tactographed? 

3. How is mexicathon broctorized? 

4. Why is it important to know about mexicathon? 

V 



Jo stimulate student thinking, questions must go beyond the 
comprehension level. Various authors categorize such higher-order 
thinking questions under different labels. For example. Bloom (1956) 
used the labels Appiication, Anaiysis, Synthesis and Evaiuation. 

In revisiting Bloom's taxonomy, Anderson,et al., (2001) use Appiy, 
Anaiyze, Evaiuate, and Create. FIGURE 3.2 uses the newer categoriza- 






tion scheme and provides sample science questions at each level. 
Note how the higher levels not only check to see if students con 
recall information, but also ask them to process information, clarify 
or expand on concepts, moke judgments, formulate generalizations, 
and/or solve problems. Such questions take students forward in their 
thinking about science concepts (and therefore enhance learning), 
while still providing the teacher with o window into students' mental 
processes (serving on assessment function). 



FIGURE 3.2 

Higher-Order Thinking Questions 
for Eighth-Grade General Science 



CATEGORY/ 
COGNITIVE LEVEL 


SAMPLE SCIENCE QUESTIONS 


REMEMBER 


1. Which planet is closest to the sun? 

2. Which planet is the largest one? 


UNDERSTAND 


1 Explain the Big Bang theory. 

2. Explain the difference between weight and mass. 


APPLY 


1 . The table on page 42 shows the distance from Earth to nine different 
stars. We know that light travels at a speed of 300,000 km per second. 
Would it take hours, days, or years for light from the Sun to reach Earth? 
From Alpha Centauri? 

2. Use your definition of gravity and of orbital velocity to answer this 
question: We are launching two satellites that will orbit Earth. One will 
orbit at a much faster rate than the other. Which one (the slower or the 
faster satellite) will we be able to place further from the surface of Earth? 


ANALYZE 


1. Difterentiate between meteors, comets, and asteroids, using at least 
three criteria. 

2. In the excerpt we read, why does the astronaut say, “Then, on the Space 
Shuttle, a goat could be our best friend?” 


EVALUATE 


1 . Of all the by-products of space exploration listed on page 93, which do 
you think is the most important? Why? 

2. Do you agree or disagree with NASA’s decision to ground the Space 
Shuttle? Defend your answer. 


CREATE 


1 . In the story we read, astronauts decided to use a goat to replace the 
waste treatment plant on the Space Shuttle. What other animals could 
be used to replace heavy or bulky equipment on the Shuttle? 

2. Design living quarters for one member of the crew on the Space Station. 



FIGURE 3.2 emphasizes the need for teachers to create questions 
ahead of time— before instruction. Higher-order questions can be dif- 
ficult to create, so teachers need time to plan these questions before 
they con implement the use of such higher-order questions in classes. 



APPLICATION 



Reflect on your own questioning skiiis. Thinking bock on your iost iesson, how 
wouid you categorize the questions you asked? How many went beyond the 
comprehension (Remember, Understand) category? For your next iesson, 
write out two questions for each cognitive levei. 

^ I / 





In addition to questioning techniques, teachers also use OBSERVATIONS 
os assessment methods. Sometimes these observations are formal 
ones; at other times they are informal. In tormo/ observations, the 
student is usually notified ahead of time and given an opportunity 
to prepare for the observation. In addition, a set of criteria may be 
shared with the student to help him in this preparation. One example 
of the use of a formal observation might be during an oral presen- 
tation. The students would have time to prepare and perhaps be 
given a list of grading criteria as "Content," "Use of Visual Aids," 
"Organization," and "Clarity." The teacher then observes the stu- 
dent's performance and rates him on these criteria. 

FIGURE 3.3 contains a sample data collection matrix for participation 
in seventh-grade class discussions of stem cell research. Following 
formal observations of students during such discussions, the teacher 
may record anecdotal notes on student performance. The teacher's 
findings can then be shared with the individual students to help them 
improve their participation in such classroom discussions. From the 
headings, it is obvious that the teacher is trying to emphasize the use 
of critical thinking in class discussions. She may be trying to enhance 
the cognitive challenge of such discussions by asking students to 
extend their knowledge beyond a recitation of facts. She wants them 
to interject inferences, justify their statements by referencing sources, 
apply knowledge, etc. 



FIGURE 3.3 

Data Collection Participation in Class Discussions 

Matrix— 7th Grade on Stem Ceii Research 



STUDENT’S 

NAME 


CONTRIBUTED 

TO 

DISCUSSION 


LISTENED 
TO OTHERS 


ARTICULATED 

TEXT 

CONCEPTS IN 
OWN WORDS 


SUBSTANTIATED 

IDEAS/ 

OPINIONS WITH 
REASONING 


MADE 

INFERENCES 


NOTES OR 
OBSERVATIONS 



















































































































Adapted from Tools for Providing Feedback in Reading: A Reading Assessment 
Handbook for All Teachers in Grades 3-12, 2004, SERVE. 



Other Sample Headings for Discussion Matrix: 

• Differentiates important from extraneous information 

• Relates text information to real life 

• Applies textual facts to a new situation 

• Makes accurate analogies 




/nformo/ observations ocour more frequently than formal ones. 
Teaohers are oonstantly watohing their students to monitor 
understanding. A teacher who notes the frown on Martha's face 
during a lecture may stop and re-explain a concept. Circulating 
during seatwork, a teacher may find that Doug is not writing units 
after his numbers. She pauses to help him. Both of these teachers 
are using informal observations as assessment methods. They are 
using observations of students to help them ascertain what students 
know and are able to do as well as using observations to monitor the 
effectiveness of their own instruction. 

In FIGURE 3.3, we saw how a formal observation instrument could be 
used to collect assessment data on student performance related to 
class discussions. Such class discussions would also fall under the cat- 
egory of Teacher/Student Dialogues. Let's look at another example 
of student/teacher interchanges in the following Classroom Scenario. 

Classroom Scenario 

In Ms. McCoy’s tenth-grade chemistry class, students 
are classifying elements as solids, liquids, or gases at 
room temperature. It is Mary's turn, and Mary has been 
assigned the element mercury, which is a liquid at room 
temperature. 

Ms. McCoy: OK, Mary, you have mercury. Is mercury a 
solid, liquid, or gas at room temperature? 

Mary: Mercury is o solid. 

(Teacher pauses and wonders if Mary has observed a 
broken mercury thermometer. When the thermometer 
breaks, the mercury liquid rolls up into small spheres that 
appear solid.) 

Ms. McCoy: Please explain the reasoning for your answer, 

Mary. 

Mary (answering with some exasperation): Mercury. Planet 
closest to the sun. Solid! 

In this scenario, we see an example of teacher/student dialogue 
used as an assessment tool. Ms. McCoy wants to understand Mary's 
answer. She is trying to diagnose the reason Mary gave an "incor- 
rect" answer. Ms. McCoy could simply have said, "Incorrect" and 
then given the right answer, "Mercury is a liquid at room tempera- 
ture." However, she paused and asked for Mary's rationale. In doing 
so, she uncovered an important misconception that Mary held. Mary 
was equating Mercury the planet with mercury the element. Without 
this conversational exchange, Ms. McCoy would not know the rea- 
son for Mary's answer. However, now that she understands Mary's 
answer, she can help correct Mary's misconception. The use of 
teacher/student dialogues, then— probing for student understand- 
ing-can be a powerful assessment method. Other venues in which 
teacher/student dialogues occur include seminars, interviews, and 
individual conversations. 



Aligning Assessment Methods 
with Learning Targets 

In this chapter, we have explored several different assessment 
methods, including selected and constructed responses, question- 
ing, observations, and teacher/student dialogues. Such assessments 
ore only valid when they actually measure what they were intended 
to measure. To foster validity, it is important to tie assessments to par- 
ticular learning targets. Let's select two sample benchmarks and then 
see if we design valid assessments for these targets. The source for 
both benchmarks is the South Carolina Science Curriculum Standards 
(www.myscschools.com/offices/cso/standarcls/science). To begin, we'll hove 
to do some unpacking: 

SAMPLE BENCHMARK 1: Name major body parts and identify the 
uses of body ports. 

KEY VERBS: Nome, identify 

EMBEDDED LEARNING TARGET: Knowledge 

This sample benchmark has knowledge as its goal. Students must 
learn the names of the body parts and be able to recall their uses. For 
example, since this is a kindergarten-level benchmark, the students 
might be expected to learn that noses ore for smelling, ears for hear- 
ing, mouths for eating, etc. What would be a good way to assess this 
benchmark? We know we have several options: selected response 
questions, constructed responses (either products or performances), 
teacher questioning, and teacher observation. FIGURE 3.4 records 
some advantages and disadvantages of each assessment method. 



FIGURE 3.4 

Body Parts Assessment 



ASSESSMENT 

METHOD 


HOW THIS 
MIGHT LOOK 


ADVANTAGES 


DISADVANTAGES 


SELECTED RESPONSE 


Multiple-choice questions 
could refer students to a 
diagram of body parts, 
while matching questions 
could ask students to 
match body parts to uses. 


Selected response 

questions can 

efficiently 

measure 

knowledge 

targets. 


Due to limited 
reading/writing 
skills of kindergarten 
students, this is not a 
good match in this 
case. Also, as with all 
selected response 
questions, there is 
the possibility of false 
positive results due to 
chance. 


CONSTRUCTED 

RESPONSE- 

PRODUCT 


Students could be asked 
to draw and correctly 
label a diagram to assess 
knowledge of names of 
body parts. They could 
write uses beside each 
body part name. 


An efficient 
method 
since teacher 
interactions with 
individuals is not 
required. 


Again, due to limited 
reading/writing 
skills of kindergarten 
students, this is not a 
good match in this 
case. 




ASSESSMENT 

METHOD 


HOW THIS 
MIGHT LOOK 


ADVANTAGES 


DISADVANTAGES 


CONSTRUCTED 

RESPONSE- 

PERFORMANCE 


Students could don on 
apron on which body 
parts could be attached 
with Velcro (as stomach, 
lungs, heart, etc.). Students 
could correctly place 
organs on the apron, tell 
their names, and tell their 
uses. 


Task is even 
more cognitively 
challenging than 
benchmark, as 
students must also 
correctly place 
the organ. 


Students would need 
time to practice with 
the apron before 
the assessment. 
Time-consuming 
assessment, as each 
student will have 
to perform for the 
teacher. 


TEACHER 
QUESTIONING 
COMBINED WITH 
TEACHER/STUDENT 
DIALOGUE 


Teacher could point to 
particular body parts on a 
diagram and ask students 
to name them. A tollow- 
up question could ask 
students to tell the use ot 
this part. 


Does not require 
reading or writing 
or even drawing. 
Should be within 
developmental 
level of students. 


Time consuming. 
Again, the teacher 
would need to 
interact with every 
student to assess 
knowledge. 


TEACHER 

OBSERVATION 


This would be similar to the 
Questioning assessment. 
The student would point 
to a diagram, name the 
parts, and give the use 
of each. However, the 
teacher would simply 
watch the student, 
without using prompting 
questions. 


Well within the 
development 
level of the 
students. 


Prompting questions 
are not present, so 
the student may 
forget to give the 
name or give the use 
of the body part. Time 
consuming, requiring 
interaction between 
the teacher and each 
student. 



As the chart shows, almost any assessment method would work for 
this benchmark. However, teachers must also take Into consideration 
the developmental level of their students. Students with limited read- 
ing or writing ability may not handle certain types of assessments 
as well os others, particularly selected response or constructed 
response-product types of assessment. (Of course, there Is no rule 
that says selected response questions must be written. The teacher 
could adopt this method and give oral choices from which the stu- 
dents may pick their answer. As long os the answer was given orally, 
this should still be well within the developmental level of kindergarten- 
ers.) Another way this could be tested Is to hove the teacher say the 
name of the body port and then have students circle the diagram 
showing this port. Since all the students could do this at their desks 
at the some time, this would not be as time consuming as perfor- 
mances, questioning, or observation methods. There Is still the prob- 
lem of giving the uses, however. 

According to the chart In FIGURE 3.4, the best assessment methods 
for this benchmark were either Constructed Response-Performance 
or Teacher Questioning combined with Teacher/Student 
Dialogue. Both ore time consuming, but both actually measure 
the knowledge embedded In the benchmark. The Constructed 
Response-Performance goes even further by stretching this minimum 
competency. 

Let's look at another benchmark, first unpacking It and then looking 
at assessment choices: 




SAMPLE BENCHMARK 2: Create and classify mixtures made of two or 
more substances (solid-solid, solid-liquid, and liquid-liquid). 

KEY VERBS: create, classify 

EMBEDDED TARGETS: Knowledge, Reasoning, Performance Skill 

Within this fifth-grade benchmark, we find at least three learning tar- 
gets embedded. Students must have some basic knowledge, as they 
must know the definitions of mixture, solid, and liquid. They will have 
to use reasoning to classify the types of mixtures. Finally, they must 
use performance skills to actually create a sample mixture of each 
type. So, what type of assessment should we use? Choose the best 
(most valid) one from the list below: 

1 ) Selected Response— Matching Question similar to this one. 

A. Solid-Solid 1. Ice water 

B. Solid-Liquid 2. Brass 

C. Liquid-Liquid 3. Oil and vinegar salad dressing 

4. Amalgam filling for a tooth 

2) Constructed Response— Product Short-Answer Question 
similar to this one. 

Give one example of each type of mixture: 

Solid-Solid 

Solid-Liquid 

Liquid-Liquid 

3) Constructed Response— Performance Task with Teacher 
Observation similar to this one. 

Using the materials in the lab, create three different mixtures. 
One must be Solid-Solid, one must be Solid-Liquid, and one 
must be Liquid-Liquid. Label your mixtures appropriately. Be 
ready to explain why these are mixtures, not compounds. 

Hopefully, you would choose # 3 above os the most valid assessment. 
The other two do not assess the Performance Skill learning target, 
even though it was embedded in the benchmark. 



APPLICATION 



Choose a benchmark from your own state science standards. “Unpack” this benchmark 
into its component ieorning targets. Then, design a voiid assessment for this benchmark. 

V y 

In this chapter, we hove concentrated on the various methods of 
assessments teachers may use to ascertain what students know and 
ore able to do. We hove particularly emphasized creating valid 
assessments— assessments that are aligned to the learning targets. 
Because the standards and benchmarks for a course may contain 
many different learning targets, a variety of assessments are needed. 

If teachers use only paper-and-pencil tests with selected response 
items, a rich source of data about student performance is missing. 

In the next chapter, we will continue looking at a variety of assess- 
ments. The emphasis in Chapter Four shifts from defining assessment 
methods to providing meaningful feedback to students. 




& kTHBR \ N & Evidence 
IN k Variety of i^ays 



MEANINGFUL FEEDBACK 



In Chapter Three, we learned that assessment methods can be cat- 
egorised as selected response or constructed response. Constructed 
responses may be further divided Into products and performances. 
Chapter Three also Introduced Questioning, Observations, and 
Teacher/Student Dialogues os methods of assessment. 



In Chapter One, we stated that assess- 
ment has two broad goals: 1) to 
provide feedback to students 
about their own learning and 
2) to monitor the effectiveness 
of Instruction. We will tackle 
monitoring the effectiveness 
of Instruction In Chapter Five. 

In this chapter, we will concen- 
trate on providing feedback 
to students about their academic 
performance, using the above 
methods of assessment. We will primarily 

concentrate on assessments other than selected response types. The 
rationale for this arises from our work with teachers over the last five 
years. It appears that teachers already feel very comfortable utilizing 
selected response Items. Teachers often have less experience, how- 
ever, with other methods of assessment. 



Gather 
Evidence in 
a Variety 
ot Ways 



Purposes of Assessment 

Reflect on your own experiences as o student. Was there o time when 
you received a low grade on an assignment because you weren't 
ready for on assessment? Perhaps you hod not had sufficient time to 
practice the skill that was being assessed. Or, perhaps you THOUGHT 
you understood o concept, only to find on the test that you were 
mistaken. Assessment experiences like these happen to all of us. 
However, os good assessors, we try to limit the number of bad assess- 
ment experiences our own students must endure. To do this, we must 
first examine the three purposes of assessment: diagnostic, formative, 
and summative. 

Teachers use DIAGNOSTIC ASSESSMENTS to ascertain the current level 
of understanding (or misunderstanding) that students possess about 
a concept prior to Instruction on this concept. The chemistry teacher 
who gives a moth test on exponents before beginning a unit on 
Avogodro's number (6.02 x 10^®) Is using diagnostic assessment. This 
chemistry teacher wonts to know If students con manipulate (odd, 
subtract, multiply, and divide) exponents before beginning o unit that 
will require them to do so. Diagnostic assessments reveal "where the 
students are now" so that teachers con plan appropriate pathways 
that will lead the students to deeper levels of understanding. Such 
assessments ore also useful In uncovering student misconceptions 
about science topics. Therefore, diagnostic assessments are power- 





ful assessment tools that help teochers design instruction that meets 
students' needs. 

FORMATIVE ASSESSMENTS are used to provide feedback to students as 
they progress toward a learning goal. Through formative assessments, 
students learn about their strengths and weaknesses before "it counts," 
i.e., before a grade is taken or recorded. Feedback during practice 
is the heart of formative assessment. For example, a sixth-grade stu- 
dent may submit the first draft of a graph for the teacher's review. The 
teacher makes comments on it or discusses the positive and negative 
features of the graph with the student. The student is then given an 
opportunity to re-draw or correct the graph before a grade is taken. 
Similarly, the teacher may provide a list of criteria to students. These 
criteria will be used to score a performance task (perhaps an oral pre- 
sentation on "Animal Adaptations to a Particular Environment"). The 
students can practice the task, score themselves, and then ask peers 
to score them before performing the task for a grade. 

In order for students to benefit from formative feedback. Black & 
Wiliam (1998, p.l43) list the following three key features of effective 
feedback. "When anyone is trying to learn, feedback about the 
effort has three elements: recognition of the desired goal, evidence 
about present position, and some understanding of a way to close 
the gap between the two." Therefore, it is important for the teacher 
to convey his/her expectations to the students, provide evidence of 
their present level of performance, and then suggest strategies to 
improve that performance. 

SUMMATIVE ASSESSMENTS are usually culminating ones. They occur after 
students have had time to practice a new skill, ask questions, interact 
with materials, etc. Summative assessments are generally used to rate 
the proficiency of students relative to a particular skill or skill set (as 
"proficient and ready to move on" to "beginning to understand, but 
needs further instruction") or they may be used to rank order students 
(as from "best" to "worst" or from "got it" to "hasn't a duel"). 

In this chapter, we will focus on formative assessments. We will 
look at ways to provide meaningful feedback to science students 
while they are learning. In this formative process, we aid students 
in becoming competent self-assessors (since grading criteria are 
shared with them ahead of time), and we help them improve their 
performances (because they receive meaningful and timely feed- 
back on those performances). 

Types of Formative Feedback 

Formative feedback is that feedback provided to students while 
they are learning. The purposes of such feedback include: 1) con- 
veying the desired learning goal or goals to students, 2) alerting 
students to misconceptions, 3) providing them with information on 
strengths and weaknesses of their performances, and 4) providing 
strategies students can use to address weaknesses. Figure 4.1 dis- 
plays the assessment methods introduced in Chapter Three and then 
suggests some feedback mechanisms appropriate for each. For the 
remainder of this chapter, we will explore examples of these feed- 
back mechanisms. 



FIGURE 4.1 

Feedback Mechanisms Matched to 
Assessment Methods 



ASSESSMENT METHOD 


FEEDBACK MECHANISM 


SELECTED RESPONSE 


Written feedback: marking incorrect answers with circie 
or X; checkmarks for correct answers; Verbai feedback 
(foiiow-up conferences/discussions with students to review 
performance on seiected response items) 


CONSTRUCTED RESPONSE- 
PRODUCTS 


Verbai feedback (discussions/conferences with students); 
Written feedback via comments or scoring rubrics 


CONSTRUCTED RESPONSE- 
PERFORMANCES 


Verbai feedback (discussions/conferences with students); 
Written feedback via comments or scoring rubrics 


QUESTIONING 


Verbai feedback (indicating correctness of answer; foiiow- 
up questions to stimuiate thinking) 


OBSERVATIONS— INFORMAL 


Verbai feedback (via discussions/conferences with 
students; comments pinpointing weaknesses and strengths; 
suggestions for improving work) 


OBSERVATIONS— FORMAL 


Verbai feedback (via discussions/conferences with 
students); written feedback via comments, markings on 
observation instruments, or scoring rubrics 


TEACHER/STUDENT 

DIALOGUES 


Verbai feedback via comments highiighting strengths and 
weaknesses; suggestions for improving work 



Verbal Feedback 

Perhaps the most common method by which students receive feed- 
back is via verboi comments from the teacher. This type of feedback 
is often used with Questioning, Teacher/Student Dioiogues, and 
informoi Observations (oithough it is possibie to use it with aii kinds of 
assessments). A simpie "Yes" or "That's correct" statement mode by 
the teacher after o student answers o question may signai to the stu- 
dent that he is on the right track. Teacher comments after on obser- 
vation, as "Trung, you need to hoid the streaking wand iightiy in your 
hand and use iess pressure when streaking the agar piate. This wiii 
prevent cuts in the ogor," con aiso provide students with feedback on 
performances os weii os guidance for addressing weaknesses. To see 
how verbai feedback might be used in Teacher/Student Dioiogues, 
iet's return to the sampie dioiogue from Chapter Three. This time, we'ii 
continue the conversation to see how Ms. McCoy provided verbai 
feedback to Mary. 



Classroom Scenario 

In Ms. McCoy’s tenth-grade chemistry class, students 
are classifying elements as solids, liquids, or gases at 
room temperature. If is Mary’s turn, and Mary has been 
assigned the element mercury, which is a liquid at room 
temperature. 

Ms. McCoy: OK, Mary, you have mercury. Is mercury a 
solid, liquid, or gas at room temperature? 




Mary: Mercury is a solid. 

(Teacher pauses and wonders if Mary has observed a 
broken mercury thermometer. When the thermometer 
breaks, the mercury liquid rolls up into small spheres that 
appear solid.) 

Ms. McCoy: Please explain the reasoning for your answer, 
Mary. 

Mary (answering with some exasperation): Mercury. Planet 
closest to the sun. Solid! 



Verbal Feedback 

Ms. McCoy: Thanks, Mary. Now I understand your 
answer. You’re right— Mercury is the planet closest to the 
sun. However, the planet Mercury is not nnade up of the 
elennent “nnercury.” I know this is confusing, since both 
hove the sonne nonne. So, let nne give you and the class a 
little history lesson. Both the planet and the elennent ore 
nonned after one of the Greek gods. This nnythologicol god 
(whose nonne was Mercury) served os the nnessenger for 
the other gods. They would send hinn down to Earth to talk 
with nnortols, or send him with messages to other gods. 

The reason Mercury was chosen to be the messenger was 
because he possessed sandals with attached wings. This 
mode Mercury very fast. So, when the gods wanted to send 
a message, they naturally chose the fastest god os their 
messenger. When Mercury, the planet, was discovered 
by astronomers, they were struck by how fast the planet 
moved around the sun. Therefore, they chose to name 
this planet after the messenger of the gods. Mercury, the 
element, also got its name from being fast. In fact, the old 
name for this element was “quick-silver.” Obviously, this 
element could move very fast at room temperature. Do 
you now understand that, even though both ore named for 
the some mythological god, the planet Mercury and the 
element mercury ore two different things? 

Mary: Yes. I just thought that the planet hod to be mode up 
of the element, since both hod the some name. Now, I see 
that they ore just named after the some person, but they 
ore two different things. 

Ms. McCoy: Good, Mary. Are you ready to answer the 
question, then, about mercury, the element? Would it be a 
solid, liquid, or gas at room temperature? 

Mary: You know. I’m still stumped. I just don’t know which 
answer to choose. 

Ms. McCoy: Then, let’s review what we know about the 
Periodic Table of the Elements. How does the table in our 
book show us the different states of matter? 

Mary: Oh, I remember! So!ids ore printed in block ink, 
liquids ore in blue ink, and gases ore in red ink. 
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Ms. McCoy: Absolutely correct, Mary. So, at room temper- 
ature, what Is the state of matter of the element mercury? 

Mary: Liquid! 

Ms. McCoy: That’s right, Mary. Mercury Is a liquid at room 
temperature. We know this because our chart shows It In 
blue Ink. If you look closely, you’ll see that It Is the only 
metal that Is a liquid at room temperature. 

In this interchange, Ms. McCoy provides feedback about Mary's 
answers, initiaiiy, Mary gives the wrong answer, due to a misconcep- 
tion. Mary is confusing Mercury the pianet with mercury the eiement. 
By expanding the Teacher/Student diaiogue, Ms. McCoy gives Mary 
(and the ciass) some background knowiedge about the eiement. 

She aiso carefuiiy heips Mary get back on track with some skiiifui 
reminders and questions (e.g.. How does the tabie in our book show 
us the different states of matter?). Once Mary is heiped to the cor- 
rect answer, Ms. McCoy provides positive feedback (e.g., "Absoiuteiy 
correct. That's right") and then restates the right answer for Mary and 
the ciass (e.g., "Mercury is a iiquid at room temperature"). Finaiiy, Ms. 
McCoy provides some additionai information that may heip the ciass 
particuiariy remember the eiement mercury (e.g., "it is the oniy metai 
that is a iiquid at room temperature"). 

in this ciassroom scenario, Ms. McCoy fuifiiis aii of the Biack and 
Wiiiam (1998) requirements for feedback. She introduces the desired 
goal (students shouid be abie to interpret the Periodic Tabie to ascer- 
tain which eiements are iiquids, soiids, or gases at room temperature), 
and she provides information about Mary's present position reiative to 
this goai through her verbai feedback, in the above interchange with 
Mary, we aiso see how Ms. McCoy reminds Mary of the strategy (way 
to close the gap) to use in formuiating her answer (using the iegend 
on the Periodic Tabie). Finaiiy, Ms. McCoy aiso heips uncover a stu- 
dent misconception (the pianet Mercury is made of mercury). 

Written Feedback 

Often, teachers use written feedback mechanisms when seiected 
response and constructed response (product or performance) 
assessments are impiemented with students. On seiected response 
items, this written feedback is most iikeiy to take the form of marking. 
As the chart in FIGURE 4.1 indicates, these markings may take the form 
of circies, X marks, or checkmarks. These symbois provide a short- 
hand method of indicating correct or incorrect answers. Such marks 
do iittie, however, to heip students ciose the gap. 

Written feedback on products may aiso utiiize marking symbois, but 
words or phrases are usuaiiy added to the paper to indicate stu- 
dent strengths or weaknesses or to provide guidance to students for 
improving performance. FIGURE 4.2 displays an example of written 
feedback on an eighth-grade product assessment. In this case, the 
product is a short-answer, recall question given to students before an 
exam. The teacher is preparing the students for several knowledge- 
level questions that will be asked on a multiple-choice, large-scale 
test. By asking the students to provide a complete explanation of 
blood circulation, she can pinpoint weaknesses in their understand- 
ing. In this manner, students can correct misconceptions or identify 
areas for further study before the exam. 



I^RITTEN Feedback' to a 

SHORT'ANSi^ER dlUESTION 



FIGURE 4.2 



QUESTION: Trace the path of a single red blood cell in the human circulatory 
system. Start at the aorta. Note when the blood is oxygen-rich and when it is 
oxygen poor. 

STUDENT ANSWER (with teacher feedback in green): r 
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In the written feedback, the teacher indicates the strengths and 
weaknesses found within the student's answer. She appears to use 
"Correct" to provide positive feedback and questions to indicate 
weaknesses. She ends her comments with suggestions about how 
Keisha con improve her performance on this subject (blood circula- 
tion) and even challenges Keisha to perform further research/study 
("When will this saying not be true?"). 

In this example of written feedback, then, the teacher identifies the 
desired goal (students will be able to trace the course of a red blood 
cell throughout the circulatory system), provides information on stu- 
dent strengths and weaknesses, and suggests strategies for address- 
ing weaknesses. This particular excerpt does not indicate any miscon- 
ceptions found in Keisho's thinking. It appears that Keisha has a basic 
understanding of the circulatory system but needs to work on the 
details. Knowing the names of particular blood vessels (as the vena 
cavae), the parts of the heart (right and left atria, right and left ven- 
tricles), and the functions of each component part of the circulatory 
system will prepare Keisha for a multitude of highly specific, knowl- 
edge-level, multiple-choice questions on this subject. In this manner, 
the short-answer question is formative in nature, preparing Keisha for 
a summative assessment that will come later. The feedback is forma- 
tive as it occurs while Keisha is still learning and because it fulfills three 
of the four purposes of such feedback. 

A Subset of Written Feedback: Scoring Rubrics 

Scoring rubrics are a special subset of written feedback. Rubrics 
generally provide a list of criteria supported by descriptions of perfor- 
mance levels relative to these criteria. FIGURE 4.3 provides an excerpt 
from such a scoring rubric, showing one criterion with its accompa- 
nying performance descriptions. This rubric might be used in a high 
school biology class. In this class, the teacher uses o machine to sim- 
ulate the heart sounds that would be heard through a stethoscope 
during o blood pressure reading. Students must identify when to take 
readings by listening to the heart sounds. 



FIGURE 4.3 

Excerpt from Rubric Used to Assess Student Skill 
in Taking Blood Pressure 



CRITERIA 


EXCELLENT 


ADEQUATE 


NEEDS IMPROVEMENT 


ACCURACY 


Student readings for 
systolic and diastolic 
pressure are within 
3 mmHg (+ or-) of 
machine setting 


student readings for 
systolic and diastolic 
pressure are within 
7 mmHg (+ or-) of 
machine setting 


Student readings for systolic 
and diastolic pressure 
are greater than 7 mmHg 
(+ or -) compared to the 
machine setting 



This rubric provides feedback to students by stating the desired goal 
(on acceptable performance is to be within 7 mmHg of the machine 
reading) and by providing information about student performances 
(students con score at the "excellent," "adequate," or "needs improve- 
ment" level). This particular excerpt from the rubric does not provide 
information on strategies students con use to close the gap between 
"excellent" and "needs improvement" performances, however. 




There are distinct advantages to using rubrics for formative feed- 
back. When rubrics are distributed to students whiie they are iearn- 
ing, the rubrics can be very instructionai in nature. Because such 
rubrics iist the criteria by which students wiii be judged and then 
provide descriptions of performances at different ieveis, students can 
use them to hone their own performances. Therefore, rubrics inform 
students of the desired goais and provide mechanisms by which 
students can seif-assess (gain information on their present position). 
Rubrics heip students pian their performances, as they provide ciues 
as to what a quaiity performance wiii inciude. These ciues can often 
be used to brainstorm strategies for improving performance. 

By using o rubric, the teacher provides the students with her expec- 
tations. She defines and describes a high-quaiity performance and 
iists the criteria by which the students wiii be judged. She distributes 
the rubric before students begin work. Then, she asks the students to 
perform peer and seif-assessments of the work, using the rubric as a 
guide. Finaiiy, she provides feedback on the student work by scor- 
ing this with the rubric. Aii of these activities (peer, seif, and teacher 
assessment) can be formative— they can take piace during the iearn- 
ing process. Therefore, the rubric serves as a feedback tooi during this 
formative process. However, once students have compieted the peer, 
seif, and teacher formative assessments, the same rubric can then 
be used to score a summative assessment, in this manner, the same 
rubric may be used both formativeiy and summativeiy. in this chapter, 
we wiii emphasize the formative use of rubrics, however. We are draw- 
ing a distinction between "scoring" an assignment and "grading" an 
assignment. For the purposes of this text, "scoring" wiii mean giving for- 
mative feedback for improvement during instruction, whereas "grad- 
ing" connotes a summative function that occurs after instruction. 

Whether used for formative purposes or summative ones, rubrics may 
be holistic or analytical In nature, it is usuaiiy very easy to distinguish 
these two types. Students wiii receive oniy one score when a hoiistic 
rubric is used, but wiii receive muitipie scores within severai different 
dimensions when an anaiyticai rubric is used. See FIGURE 4.4 for sam- 
pie hoiistic and anaiyticai rubrics. 



FIGURE 4.4 
Types Of Rubrics 



Holistic Rubric for Seventh-Grade Writing in Science 



SCORE POINTS AND LEVEL 


DESCRIPTION 


6: 

HIGHEST LEVEL 


Main ideas and supporting detaiis are entirety accurate, specific, 
ciear, and sufficient; Higher-order thinking highiy deveioped; 

Strong organizationai structures throughout; Strong use of ianguage 
throughout; Diagrams or sketches are accurate, ciear, and focused 
on the topic 


4: 

SOMEWHAT DEVELOPED 
RESPONSES 


Main ideas and supporting detaiis are mostiy accurate and 
mostiy specific; Higher-order thinking deveioped; Satisfactory 
organizationai structures; Language used in a satisfactory manner; 
Diagrams or sketches are on the topic 


2: 

UNDEVELOPED 

RESPONSE 


Main idea with no support and/or contains major errors in using the 
main science concepts important to the task; Very iittie evidence 
of higher-order thinking; Unorganized; Some ianguage used 
incorrectiy; Diagrams or sketches are off topic or absent 


0: 

LOWEST LEVEL 


No response 



Analytical Rubric for Seventh-Grade Writing 
in Science 



CRITERIA 


LEVEL 1— LOW 


LEVEL 2 


LEVEL 3 


LEVEL 4— HIGH 


MAIN IDEAS 


Not present 


Present but 
not ciear; May 
inciude major 
errors 


Ciear and 
accurate; May 
inciude minor 
errors 


Ciear, accurate, 
and sufficient in 
number to cover 
the topic 


SUPPORTING 

DETAILS 


Not accurate 


Accurate but 
very sparse; May 
inciude major 
errors 


Accurate and 
sufficient in 
number to 
support the 
main ideas; May 
inciude minor 
errors 


Accurate, sufficient 
in number, and 
especialiy weii 
chosen to support 
the main idea 


HIGHER-ORDER 

THINKING 


Not evident 


Somewhat 

evident 


Evident 


Strongiy evident 


ORGANIZATION 


Very unorganized 


Somewhat 

organized 


Organized 


Strongiy organized 
throughout 


USE OF 
LANGUAGE 


immature use of 
vocabuiary 


Vocabuiary 
inciudes some 
topic-specific 
words 


Vocabuiary 
inciudes a good 
seiection of 
topic-specific 
words 


Vocabuiary 
chosen does an 
exceiient job of 
addressing both 
the topic and the 
audience 


SKETCHES 


No sketches 


Sketches present 
but do not 
support the 
writing weii 


Sketches 
accurate and 
do support the 
writing 


Sketches accurate 
and especiaiiy 
ciear in supporting 
the writing 



Adapted from: Hibbard, K. M. ( 2000). Performance-based learning and assessment in 
middie school science. Larchmont NY: Eye on Education. Pages 70-71, 74. 





From examining these two rubrics, you can see that the student 
wouid receive oniy one score in the first, hoiistic rubric. She wouid 
get either 6 or 4 or 2 or 0. However, in the second, anaiyticai rubric, 
the student wouid get a score in six different dimensions (Main idea. 
Supporting Detaiis, Higher-Order Thinking, Organization, Use of 
Language, and Sketches). 

Let us suppose that a seventh-grade student, Marty, submitted a 
sampie of scientific writing, in this piece, Marty covered the main 
idea very weii, but was weak in supporting detaiis. He showed excep- 
tionai higher-order thinking, as he reiated the science concept to 
his own iife and extrapoiated possibie future uses of this concept. 
However, his organization tended to skip from subject to subject and 
back again, negativeiy impacting his ciarity. His word choices were 
high ievei. Marty forgot to inciude a sketch. How wouid Marty do on 
each of the rubrics? 

Perhaps the teacher using the hoiistic rubric gave Marty an overaii 4 
due to the poor organization and iack of sketch. The teacher using 
the anaiyticai rubric recorded the foiiowing for Marty: 



Main idea 4 

Supporting Detaiis 2 

Higher-Order Thinking 4 
Organization 3 

Use of Language 4 

Sketches 1 



Which type of rubric wouid provide the most meaningfui formative 
feedback to Marty, to heip him improve his scientific writing perfor- 
mance? Oertainiy it wouid be the anaiyticai one, which cieariy shows 
Marty's strengths and weaknesses. This anaiyticai rubric couid greatiy 
aid Marty in seif-assessing his own work by informing him of targets, 
indicating his present position reiative to the targets, and suggest- 
ing ways to improve his performance (via performance indicators on 
the rubric, as comparing the descriptions of performance ieveis for 
Supporting Detaiis). By reading the rubric, Marty couid find that he 
needs to provide more supporting detaiis (his were sparse) and that 
these detaiis must support his main idea. The rubric aiso notifies Marty 
of the necessity of enciosing a sketch with his work. Marty can use 
this information to improve his performance before a grade is taken 
on his writing in science. Since the anaiyticai rubric provides greater 
detaiis on student performance (giving a score in severai different 
dimensions to heip students pinpoint strengths and weaknesses), we 
recommend the use of analytical rubrics with formative assessments. 

Creating High-Quality Analytical Rubrics 

Anyone who has attempted to create a rubric from "scratch" has 
probabiy found that this is no easy task, it is not unusuai for teach- 
ers creating rubrics to overiook an important criterion and oniy reai- 
ize they did so when they use the rubric to score student work. For 
exampie, one teacher created a rubric for assessing student posters 
on types of muscies. Her criteria inciuded: Ciarity, Use of Coior, Size, 
and Depiction of Three Types. Oniy when she used the rubric to score 
student posters did she reaiize she had not inciuded Accuracy as a 
criterion. Therefore, if students iabeied the types of muscie incorrectiy 
(perhaps iabeiing cardiac muscie as skeietai muscie), nowhere in her 



rubric could she count off for this! To avoid such embarrassing experi- 
ences, we recommend that you create new rubrics while examining 
student work from past years. One way to begin this process is to sort 
the work into piles labeled "Low, Medium, and High." Then, by look- 
ing at the differences in the work, you con begin to construct o list of 
important criteria. After you hove the criteria, you con write descrip- 
tions for the different proficiency levels. 

Another excellent way to create rubrics is to involve students in the 
development process. One reason for doing so is that many minds 
ore better than one: Students may think of criteria that the teacher 
would miss. Another important reason for involving students in rubric 
creation is simply because of the educative value of this process. By 
discussing the criteria for quality work, students will get o clearer idea 
of the teacher's expectations. Whether rubrics ore teacher gener- 
ated or student constructed, it is imperative to keep the learning 
target to the forefront, so that the rubric accurately reflects what stu- 
dents are being asked to accomplish. 

FIGURE 4.5 has a confusing name, but it may help you understand the 
important characteristics of a high-quality rubric. It is a rubric that con 
be used to judge the quality of an analytical rubric that you create 
for use in your classroom. If you score your teacher-constructed class- 
room rubric using FIGURE 4.5, you may find weaknesses in the class- 
room rubric that you can correct before you disseminate it to students. 
By using FIGURE 4.5, you may enhance the quality of the rubric. 



FIGURE 4.5 

A Rubric for Assessing the Quality 
of on Analytical Rubric 





CRITERIA 




Excellent (5) 


Acceptable (3) 


Unacceptable (1) 


f 


Criteria identify aii 
components necessary for 
high-quaiity work. 


Sufficient criteria are present 
to define high-quaiity work, 
but some “fine-tuning” criteria 
may be missing. 


Rubric faiis to inciude criticai 
criteria for exceiience (eg., 
content accuracy). 


2 


Criteria reflect aiignment to 
aii stated objectives. 


Minor objectives are not 
sufficiency addressed by the 
criteria. 


Major objectives are not 
sufficiency addressed by the 
criteria. 


3 


Criteria are cleariy feasibie 
and measurabie. 


Criteria are measurabie and 
reasonabiy feasibie. 


Criteria iisted in the rubric may 
not be measurable, or they 
have low feasibility (e.g., they 
would require the teacher to 
individually observe each 
student, thereby using more 
time than warranted by the 
importance of the criterion). 



FIGURE 4.5 (continued) 



DESCRIPTORS 




Excellent (5) 


Acceptable (3) 


Unacceptable (1) 


Descriptors always use 
“kid-friendly” language, 
language easily 
understood by the target 
student population. 


Descriptors usually use “kid- 
friendly language,” but some 
“educationalese” is present. 


Descriptors consistently use 
language inappropriate for the 
developmental/academic/ 
grade level of the student 
population. 


Descriptors use observable 
characteristics and avoid 
overuse of vague quanti- 
tative terms (e.g., several, 
numerous, some). 


Descriptors rely more on 
quantitative terms than on 
observable characteristics. 


The use of too many generalities 
(as sometimes, occasionally, 
some, etc.) detracts from the 
clarity and leads to confusion. 


Descriptors accurately and 
specifically describe levels 
of performance within the 
criterion. 


Descriptors accurately 
describe levels of perfor- 
mance, but specificity may 
be low (e.g., for a writing 
rubric, the descriptors might 
say the student “uses stan- 
dard English.” This does not 
specifically explain what 
“standard English” means). 


Descriptors and criteria do 
not match OR there are no 
descriptors OR descriptors use 
one word (e.g., “always” for 
best performance, “sometimes” 
for acceptable performance, 
and “never” for unacceptable 
performance). 



SCORING GUIDE 





Excellent (5) 


Acceptable (3) 


Unacceptable (1) 


The most important criteria 
are allotted the most 
number of points, thereby 
allowing students to 
prioritize efforts. 


All criteria earn the same 
amount of points, as all are 
equally important. 


All criteria earn the same amount 
of points, but some are clearly 
more important or more aligned 
to the objectives than others. 


The scoring guide clearly 
demonstrates how rubric 
scores will be converted to 
grades. 


A conversion scale is present, 
but may not address all 
grading levels (e.g. “10 points 
is an A” is stated, but no 
distinctions are given for A+ 
grades or A- grades). 


No conversion scale is present. 



FORMAT (TYPE) OF RUBRIC 



Excellent (5) 



Type of rubric used sup- 
ports purpose of assess- 
ment (e.g., on analytical, 
task-specific rubric for a 
formative assessment; a 
holistic, generalized rubric 
for a summative assess- 
ment). 



Acceptable (3) 

Type of rubric used is appro- 
priate (e.g., analytical for 
formative assessments and 
holistic only for summative 
assessments), but general- 
ized may be substituted for 
task-specific rubrics if students 
will perform several iterations 
of this type of work through- 
out the school year (e.g., an 
“oral presentation” rubric). 



Unacceptable (1) 

Type of rubric does not support 
purpose of assessment (e.g., a 
holistic rubric is used to judge 
performance on a newly taught 
skill). 







QUALITY OF THE FEEDBACK 




Excellent (5) 


Acceptable (3) 


Unacceptable (1) 


lo 


Rubric scores clearly 
pinpoint student’s 
strengths and weaknesses; 
student obtains sufficient 
information on strategies 
that can be used to 
improve performance. 


Rubric scores clearly pinpoint 
student’s strengths and 
weaknesses and provide 
some information on 
improvement strategies. 


Rubric gives insufficient informa- 
tion to allow student to improve. 


II 


Rubric is instructional in 
nature; students can read 
and understand what 
high-quality work will look 
like; students can use 
rubric to self-assess own 
performance without input 
from the teacher. 


Rubric is instructional in 
nature; students can read 
and understand what high- 
quality work will look like. 


Rubric does not contain sufficient 
information for students to under- 
stand teacher expectations and/ 
or what quality work will look like. 



Sample Rubrics 



APPLICATION 



In this final section of Chapter Four, we 
have included sample rubrics for several 
tasks typically found in science classrooms. 

The rubrics we hove chosen are ones that 
are analytical in nature, in hopes that 
they will be used to provide meaningful 
feedback to students during the learning 

process. The rubrics are designed for use in o high school classroom. 
Each rubric also includes a sample score conversion chart. It should 
be emphasized, however, that the purpose of providing such scores 
is informational, rather than evaluative in nature. The scores show the 
students the proficiency level at which they are currently working. They 
allow the students to predict a grade based on current performance, 
in hopes of encouraging improvement in this performance. Since 
the rubrics included here ore formative in nature and because they 
are being used to provide feedback to students during the iearning 
process, grades would not actually be recorded. These same rubrics 
could be used summatively, however, otter students ore given suffi- 
cient practice time and quality feedback on their practice attempts. 



Select a rubric you use in your science 
classroom. Score this rubric using 
Figure 4.5. Redesign the rubric based 
upon your findings. 



Cautionary notes: The following rubrics are samples only. They are 
not ready-made rubrics available for instant use in your classroom. 
Teachers should use caution in using these or any other published 
rubrics. Rubrics should always be carefully evaluated for alignment 
with the curriculum being used. Since different states hove different 
standards, it is important for the readers of this manual to understand 
that the sample rubrics included here may provide a better fit for 
some standards than for others. 



Each sample rubric is preceded by o brief explanation of the task stu- 
dents are being asked to perform. Connections to content standards 
from the National Science Education Standards are also provided 
for each rubric. Before incorporating one of these sample rubrics in 
your own classroom, you should first evaluate it using Figure 4.5. If 
the rubric does not align with the standards used in your classroom, 
it will need to be revised before you utilize it. If the greatest number 
of points ore awarded to criteria that are of little value in your cur- 
riculum, the point values should also be adjusted. Finally, the sample 





rubrics all use a grade of "C" to indicate mastery of the task. If you 
wish to set a higher grade for mastery, you may need to indicate this 
in the score conversion chart by having students resubmit work below 
your mastery level. 

The sample rubrics, like all rubrics, are not meant to be used as 
stand-alone instructional tools for students. While these rubrics are 
instructional in that they provide lists of criteria and descriptions of 
performance levels, students will need support in understanding 
the descriptions and in utilizing the rubrics. When introducing a new 
rubric to the class (particularly one that was primarily teacher- 
generated without input from students), the following suggestions 
may be helpful: 

1) Set aside time within class for students to read through 
the rubric. 

2) Highlight the learning target to be scored with this rubric. 

3) Review each criterion, with its accompanying performance 
level descriptions with the students. During this review, ask 
students to put the descriptions in their own words and ask 
them to define unfamiliar vocabulary. If they are unable to 
perform these tasks, more teacher explanations about the 
rubric are needed. 

4) Provide a sample (or samples) of student work that students 
can score by using the rubric. Have students share their scores 
and explain why particular scores were given, using "rubric 
language" (language from the rubric) to justify the scores. 

5) Work together with students to revise the rubric. Change lan- 
guage to be more kid-friendly, based upon student recom- 
mendations. Add or delete criteria as needed to more closely 
align the rubric with the learning target. 

6) When using the rubric to score formative assignments, don't 
hesitate to add written comments. Don't feel constrained to 
simply circle particular scores for each dimension or to high- 
light rubric phrases. Teacher comments may help students 
understand strategies they can use to improve performance. 



APPLICATION 



Use the “Rubric for Assessing the Quaiity of on Anaiyticoi Rubric” 
found in Figure 4.5 to evaiuofe one or more of the sompie rubrics 
found in Figures 4. 6-4. 9. What ore the strengths and weaknesses 
of these rubrics? 

\ 2 

With the above cautions in mind, review the sample rubrics with a 
critical eye. Use the following Application suggestions to enhance 
the utility of each rubric in your science class. 




APPMCATION 



For each rubric: 

1 . Rewrite, using icngucge that students at the ievei you teach 
wouid understand (“kid ianguage”) instead of the high 
schooi ianguage incorporated in these rubrics. 

2. Revise the rubric as needed to aiign with your curricuium. 

3. Check the rubric to ensure that the score conversion chart 
accurateiy portrays mastery leveis. (For exampie, in the 
iaboratory report rubric, Figure 4.9, wouid 34 points— the 
“C” ievei— correspond to mastery? if not, revise the score 
converstion chart.) 

4. Test the rubric by using it with your students. 

5. Rewrite the rubric to address any shortcomings you found 
upon impiementotion or to moke the rubric more reievont to 
your classroom. 

6. Check the final rubric to ensure that it fulfills the purposes of 
formative feedback which include: a) conveying the desired 
learning goal or goals to students, b) alerting students to 
misconceptions, c) providing them with information on 
strengths and weaknesses of their performances, and d) 
providing strategies students con use to address weaknesses. 

V / 

Designing an Experiment 

In science, students are often asked to 
design on experiment in order to answer 
a question or to solve o problem. The 
rubric displayed in Figure 4.6 may be 
used to provide feedback on this design 
process. This rubric might be used when 
students ore given "mystery powders" as 
well as several testing solutions and then 
asked to create o process for identifying 
each powder. With the modification sug- 
gested at the bottom, this rubric could 
also be adapted to provide feedback 
on such engineering design tasks as cre- 
ating a "cradle" that will protect the egg 
from breaking in the classic "egg drop" 
experiment. 

This rubric may be used to assess the following in grades 9-12 
(Notional Research Council, 1996, p. 175): 

• Identify questions and concepts that guide scientific 
investigations. 

• Design and conduct scientific investigations. 



RESOURCES 



Websites for Rubrics: 

• www.accessexcellence.org 

• http://rubistar.4teachers.org/index.php 

• www.rubrics4teachers.com 

• http://intranet.cps.kl 2. il.us/Assessments/ 
ldeas_and_Rubrics/Rubric_Bank/ 
rubric_bank.html 

V 2 
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Score Conversion (without Engineering Design criterion) 

48-54 -> A 40-47 -> B 30-39 -> C Below 30 -> Resubmit 





Presenting Scientific Findings 
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RELEVANT SCIENTIFIC PROBLEM 

CONCEPTS STATEMENT 



Writing a Lab Report 

Another common task given to science students is to write a iabora- 
tory report of an experiment. The rubric shown in Figure 4.9 wouid 
provide meaningfui feedback to high schooi students writing such 
reports. 

This rubric addresses parts of the foiiowing (Nationai Research 
Council 1996 pp. 174-176): 

• Design and conduct scientific investigation. 

• Use technoiogy and mathematics to improve investigations 
and communications. 

• Formuiate and revise scientific expianations and modeis 
using iogic and evidence. 

• Communicate and defend a scientific argument. 



RESOURCE 



The following website may 
be useful for high school 
teachers. This “Lob Write” site 
is on interactive one for 
teachers and students. It 
provides instruction in 
writing lob reports and offers 
examples of such reports: 
http://labwrite.ncsu.edu 



CRITERIA EXEMPLARY ADEQUATE RESUBMIT 



5 points 


3 points 


1 point 


Problem is stated as 
a question. Problem 
is appropriate to 
assigned task. 


Problem is clearly 
identified but may 
not be in the form of a 
question, and problem is 
appropriate to assigned 
task. 


Inappropriate problem 
or problem that is 
insufficiently identified/ 
explained. 


7 points 


5 points 


2 points 


Student explains all 
scientific concepts 
(correctly using scientific 
terms/vocabulary) 
relevant to the problem, 
explaining why these are 
relevant. Student provides 
references for the 
background information 
from text or reference 
materials. 


Student explains several 
scientific concepts (cor- 
rectly using scientific 
terms/vocabulary) rel- 
evant to the problem, but 
may fail to include others. 
Student explains why the 
concepts presented are 
relevant, and student pro- 
vides references from text 
or reference materials. 


Scientific concepts are 
missing OR irrelevant to 
problem OR student fails 
to correctly use scientific 
terms/vocabulary OR stu- 
dent fails to provide refer- 
ences from text or other 
reference materials. 



FIGURE 4.9 
Laboratory 
Report Rubric 






CRITERIA 



EXEMPLARY 



ADEQUATE 



RESUBMIT 





5 points 


3 points 


1 point 




Hypothesis/prediction 


Hypothesis/prediction is 


Hypothesis/prediction 


(/> 


is based on proper use 


based on background 


is not supported by 




and interpretation of 


information, but some 


background information 


III 


background information 


essential information may 


OR is not oiigned to the 




and is not just o guess. 


be missing or uncieor. 


experiment. 


O 


Hypothesis is cieoriy 


Hypothesis is oiigned to 




> 


stated and oiigns with 


experiment. 




X 


the experiment (the 
hypothesis con be 
supported or refuted by 
the data coiiected). 








5 points 


3 points 


1 point 


^ 9 


Aii moteriois and equip- 


Aii moteriois and 


List of moteriois/ 


ment needed ore iisted. 


equipment needed ore 


equipment is incompiete 


sis 

“ 5 a 
< 0 z 
S w 


iiiustrotions/diogroms 
ore provided to enhance 
ciority. Such graphics 
inciude iobeis giving 
names of reievont equip- 
ment/moteriois. 


listed. 


OR missing. 








7 points 


5 points 


2 points 


2 u 
z ^ 


Steps in procedure 


Steps in procedure ore 


Steps ore not sequenced 


UJ S 


ore numbered and 


sequenced iogicoiiy. 


iogicoiiy O/? insufficient 


^ 111 


sequenced iogicoiiy. 


Sufficient expiicit detoiis 


detoiis ore present to 


o 


Sufficient explicit detoiis 


ore inciuded to oiiow the 


oiiow the reader to 


s;g 


ore inciuded to oiiow the 


reader to dupiicote the 


dupiicote the experiment. 


2“ 


reader to dupiicote the 


experiment. 




111 


experiment. 








5 points 


3 points 


1 point 




Aii possibie reievont 


The most important 


important safety issues 


(/> 


safety issues ore iisted, 


and reievont safety 


ore not iisted O/? student 


z 

n 


and student describes 


issues ore iisted, and 


foiis to describe how 




how to avoid/minimize 


student describes how 


listed issues con be 




such issues. 


to avoid/minimize these 


avoided/minimized. 


< 

o 

LU 

D. 

> 

LU 

11 . 

< 

W 


There is iittie or no risk of 
safety or discomfort to the 
student. 


issues. However, student 
foiis to mention some 
minor safety issues that 
ore oiwoys reievont (os 
checking giosswore for 
cracks). Some further 
expionotions couid mini- 
mize discomfort to the 
student, but safety risk is 
minimoi. 


Safety risk is not 
appropriate. 




10 points 


7 points 


4 points 


< 


Student identifies controis, 


Student identifies controis. 


Student foiis to correctiy 




dependent, and inde- 


dependent, and indepen- 


identify controis or 


o 


pendent voriobies cor- 


dent voriobies correctiy. 


variables OR no units 


o 


rectiy. Data is dispioyed 


if measurements ore 


ore included 0/?not 


111 


grophicoiiy in on easy to 


token, correct units ore 


oil significant data is 




understand form, if meo- 


inciuded. Ail significant 


recorded. 


o 


surements ore token, cor- 


data is recorded. 




o 


rect units ore inciuded. 






111 


Aii significant data (data 
needed to soive the prob- 
iem) is recorded. 








FIGURE 4.9 (continued) 



CRITERIA 


EXEMPLARY 


ADEQUATE 


RESUBMIT 




7 points 


5 points 


2 points 


(/> 


Formulas are provided 


Appropriate tormulas are 


Formulas are inappropri- 


z 

o 


and used correctly 


used, but may not be writ- 


ate or not in use OR mea- 


w 


(e.g., Density = Mass 


ten out. All measurements 


surements lack units OR 




divided by Volume). 


contain proper units. Most 


greater than 2% error on 




All measurements 


calculations are per- 


calculations. 


o 


contain proper units. 


formed correctly, yielding 






All calculations are 


accurate answers (inac- 




o 


performed correctly, 
yielding accurate 
answers. 


curacy must not occur in 
greater than 2% of calcu- 
lations). 






15 points 


10 points 


5 points 


(/> 


In the conclusion, 


In the conclusion, student 


Conclusion does not 




student states if hypoth- 


states if hypothesis is 


refer to the hypothesis 


o 


esis is accepted or 


accepted or rejected. 


0/7 conclusion is not 




rejected. Any conclusion 


The conclusion, while 


supported by the data. 


z 

o 


is based upon the data 
collected and the ratio- 


supported by the data, 
may be incompletely 




u> 

o 

z 

o 

o 


nale for the conclusion is 
completely explained by 
referring to this data. The 
student’s rationale leads 
correctly to the accep- 
tance or rejection of the 
hypothesis. 


explained. 






7 points 


5 points 


2 points 




Graph form chosen 


Graph form chosen is 


Graph is missing OR 




is appropriate for the 


appropriate for the data. 


form is inappropriate 


2o 


data. All data points are 


All data points are accu- 


OR required titles/labels 


accurately plotted. Graph 


rately plotted. Graph 


are missing O/? intervals 


3 3: 


contains an overall title 


contains an overall title 


are not consistent OR 


o < 


and axes are labeled. 


and axes are labeled. 


measurements are not 


z “ 


Intervals are appropriate 


Intervals are consistent 


given. 


9 w 


and consistent and 


and measurements 




o 


measurements are given. 
Graph is easy to read and 
understand. 


given. Some use of inap- 
propriate intervals may 
reduce clarity, making 
graph hard to read. 






5 points 


3 points 


1 point 


z 

o 


Student uses the rubric to 


Student uses the rubric to 


Student fails to use 




rate his/her performance 


rate his/her performance 


rubric to rate his/her 


o 


and highlights areas 


and highlights areas of 


performance OR 


Ul 


of weakness. Student 


weakness. 


uses rubric, but fails 


IL 

Ul 


suggests appropriate 




to highlight areas of 


a 

;core Convert 


strategies for addressing 

. these weaknesses. 

tfij] 




weakness. 



60-75 A 46-59 B 34-45 C 23-33 D Below 22 Resubmit 




kNkLvz } N & Assessment 
Data and a/Va«in6 

Instructional 

MODIflCATIONS 




So far in this pubiication, we have expiored two quadrants of the 
Ciassroom Assessment Cycie. These are: 1) Ciarify iearning targets 
and 2) Gather evidence in a variety of ways. These two quadrants 
compose the haif of the Ciassroom Assessment Cycie that is most 
commoniy impiemented in ciassrooms. The remaining two quad- 
rants of the Cycie are iess frequentiy seen in ciassrooms. it seems that 
teachers sometimes stop moving through the Cycie after they coiiect 
assessment data. They faii to foiiow through and anaiyze the data in 
order to make inferences or draw conciusions about student iearning. 
Let's examine transcripts of two interviews with science teachers to 
see which of the two teachers is compieting the Cycie. Both teachers 
teach third grade. 



Interview One: Interviewer (I) and Mrs. Esposito (E) 

I: I see in your unit plan that you’ve included a diagnos- 
tic assessment on Day One. Can you tell me about this 
assessment? 

E: Yes. The students will start a KWL chart (What I Know, What 
I Want to Know, What I Learned) before we begin our 
study of plants. They’ll keep this in their notebooks and 
add to it every day. 

I: What is your goal for this activity? 

E: I want to activate the prior knowledge of the students 
and then encourage them to track their own learning. 

I: How will you know if you’ve met these goals? 

E: They’ll turn in their completed KWL charts at the end of 
the unit. 



i 



J 



CHAPTQ? FIVE 



I: How will your assessment data be used to make instruc- 
tional modifications? 

E: If some students turn in incomplete KWL charts at the end 
of this unit, I’ll conference with these students. 

Interview Two: Interviewer (I) and Mrs. Gregory (G) 

I: Tell me about this seminar you’re planning on scientific 
inquiry. 

G: The students will do some text reading and define terms 
(e.g., problem, hypothesis, control) before the seminar. 
Then, I’ll present a scientific problem to them about 
plants. For example, I might just refer to the plant we 
have in the classroom that looks like it’s dying and ask, 
“What can we do?” This will begin the class discussion 
on the scientific inquiry. 

I: What is your goal for this activity? 

G: Actually, I have two. I want to build student knowledge 
of steps involved in a scientific investigation, and I want 
them to practice their seminar skills (listening, speaking, 
critical thinking). 

I: How will you know if you’ve met these goals? 

G: I’ll take anecdotal notes during the seminar, using a 
matrix I’ve prepared. This will help me keep track of the 
seminar skills demonstrated by each student. To check 
their knowledge of scientific inquiry. I’ll give the students 
another problem and ask them to design an experiment 
(working in small groups) that will help solve the problem. 

I: How will your assessment data be used to make instruc- 
tional modifications? 

G: I’ll review my notes about the seminar and conference 
with students to give them feedback about their perfor- 
mances. In the individual conferences, we’ll brainstorm 
strategies for improving their performances. Students will 
present their ideas for the new experiment to the class 
and will receive feedback from me and from the other 
student groups. Based on the presentations, I will note 
areas of difficulty and plan either some learning center 
activities or some re-teaching strategies to use with small 
groups in order to address these. 



Both teachers are collecting assessment data. It is clear that Mrs. 
Gregory is utilizing the data to impact her instruction in a more timely 
manner than that employed by Mrs. Esposito. For example, if Mrs. 
Esposito only takes up her diagnostic KWL chart at the end of the 
unit, she is missing the opportunity to discover what her students knew 
about plants before beginning the unit. She is also failing to identify 



possible misconceptions the students hold. Without this knowledge, 
she cannot plan instruction that will effectively eradicate these mis- 
conceptions. Therefore, her "diagnostic" assessment is not actually 
being used to diagnose student learning. In fact, she is using this 
KWL chart more as a formative assessment. (She seems to be using 
the KWL more as a self-assessment tool for students; one that helps 
students understand what they've learned and what they still need 
to learn.) Looking at the KWL at the end of the unit means that Mrs. 
Esposito is not using assessment data to help her adjust instruction 
in this unit. The data she collects may help her structure future units, 
but the data arrives too late to help students learn the material in the 
current unit. So, no immediate instructional modifications are being 
made by Mrs. Esposito. 

Conversely, Mrs. Gregory is using her assessment data to continuously 
modify her instruction. She is collecting anecdotal data on seminar 
performances, reviewing these, and then helping students strategize 
ways to improve. She is also using the assessment of the student 
experiment presentations to differentiate instruction for those who 
are struggling. She plans to note areas of difficulty and then address 
these in learning centers or by re-teaching small groups of students. In 
this manner, Mrs. Gregory's assessment data has immediate impact 
upon instruction. Students receive timely feedback that will help 
them improve performance in the current unit. 

From these two scenarios, we see that it is not enough to collect the 
assessment data; the teacher must also: 

• Review the assessment data immediately after collection. 

• Interpret the data, formulate inferences and/or conclusions. 

• Provide feedback (verbal, written comments, rubric scores, 
etc.) to students relative to their performances. 

• Use the findings of the analysis to immediately adjust 
instruction. 

In order to proceed effectively through these steps, we need to 
examine what is meant by analysis and look at ways we can modify 
instruction to enhance individual student performance. 

Analysis 

The graphic in FIGURE 5.1 provides a perspective on the analysis of 
data. The organizing question that the teacher must ask before 
beginning this analysis is, "What did the student learn and how well 
did he/she learn it?" We have seen that it is important to ask this 
question frequently; perhaps even on a daily basis. In this manner, 
assessment data can have an immediate effect on instruction in 
that the teacher can intervene quickly to address individual student 
learning difficulties. Data analysis can provide teachers with insights 
about student learning in the short term. However, continuous data 
analysis can also be useful in the long term. As we examine FIGURE 5.1, 
let's work through an example of the long-term use of data analysis. 




FIGURE 5.1 

Data Analysis Process 



At the end of the grading period, Mr. Giiiey, a sixth-grade 
teacher, sat down to exannine the assessnnent data he had 
coiiected over the quarter. He was particuiariy interested in 
reviewing assessnnents fronn those students who appeared 
to be struggiing— those students who nnost often needed 
individuai attention fronn hinn in order to be successfui. 

He oiso iooked at students who hod iow overage grades. 
Surprisingiy, one such student was Luci. 

Before reviewing the assessments, Mr. Giiiey hod thought of 
Luci os one of his best students. She oiwoys seemed atten- 
tive and appeared to understand oii the concepts pre- 
sented. Looking at Luci’s assessments in the Adaptations of 
Animois unit reveoied the foiiowing: 

1) Diagnostic paragraph that asked students to define 
“adaptation,” give one exampie of an animai adapta- 
tion, and expiain how this adaptation aided the animai 
to survive: Luci did not attempt to define adaptation, but 
wrote, “Poiar bears— fur.” 

2) Reading comprehension sheet that inciuded fiii in the 
biank and short-answer question. This worksheet foi- 
iowed an assigned text reading on animai adapta- 
tions: Luci successfuiiy compieted the fiii-in-the-biank 
questions. On the short-answer questions, Luci oniy 
attempted two of the four questions. She wrote her 
answers as phrases, not as compiete sentences. Most of 
her answers were factuai in nature, even when the ques- 
tion asked for anaiysis. This caused her to receive a iow 
assessment on this assignment. 

3) Matching test on terms from the chapter: 

Luci made a 100. 

4) Animai Adaptation Project in which students were 
assigned a particuiar ciimatic zone and then asked to 
create an imaginary creature who couid successfuiiy 
survive in this zone. The project required that students 
draw their creature and then write a supporting para- 
graph identifying the adaptations and expiaining why 
these adaptations wouid make the creature success- 
fui: The rubric used for scoring this project is shown in 
Figure 5.2. The rubric shows that Luci provided a detaiied 
drawing of her creature and iabeied this iiiustration with 
arrows and words as “ciaws— catch food.” Her animai 
was cieariy weii adapted to survive in the tundra zone 
she was assigned. Luci’s paragraph was very sketchy. It 
contained a iist of adaptations with few expianations. 
Therefore, she earned very few points within the “para- 
graph” portion of the rubric. This negativeiy impacted 
her overaii performance on this project. 




GRAMMAR PARAGRAPH DRAWING DETAILS 



FIGURE 5.2 

Animal Adaptation Project Rubric— Luci's Scores 



CRITERIA I 



o 



EXCELLENT 



ADEQUATE 



RESUBMIT 



< </» 
O! 

a 



O z 

I- m 
u Z 

< s 

Q > 

< Z 



1^2 pt^ 


1 Pt. 


'"AHITti^ is sized to use most 
of the space on a standard 
piece of notebook paper 


Animal is sized to use at least 
% of the space on a standard 
piece of notebook paper 

^ 


12 pts. ( 




Animal is shown with more 
than 3 adaptations. Each 
adaptation is labeled with 
a name and a tunction. 

The function matches the 
adaptation (Ex: teeth are 
labeled tor fighting or eating 
tunctions, not for adapting 
to weather). 


Animal is shown with at 
least 3 adaptations. Each 
adaptation is labeled with 
a name and a tunction. 
The tunction matches the 
adaptation. 


8 pts. 


6 pts. 


Paragraph accurately 
explains why animal will 
need the adaptations for 
the assigned environment. 
At least 4 adaptations are 
explained. 


Paragraph accurately 
explains why animal will 
need at least 3 adaptations 
for the assigned 
environment. 


5 pts. 


3 pts. 


Paragraph contains no 
spelling, capitalization, or 
other grammatical errors. 
Student writes in complete 
sentences. 


Paragraph contains 
2-3 grammatical errors. 
Student writes in complete ^ 
sentences. ( 


4 pts. 


^3pt^ 


More than 3 adaptations 
match animal needs in the 
assigned environment. 


Afleast 3 adaptations 
match animal needs in the 
assigned environment. 



Opts. 



Animal takes up less than Y4 
of the space on a standard 
piece of notebook paper 



0-8 pts. 

Animal is shown with fewer 
than 3 adaptations OR 
adaptations are not labeled 
with names and function. 



0-5 pts. ^ ^A‘ 

Paragraph fails to 
accurately explain the 
animal’s need for at least 3 
adaptations. 

0-2 pts./ 

Paragraph contains more 

y? 

OR student tails to write ir 
complete sentence^ 



0-2 pts. 

Less than 3 adaptations 
match animal needs in the 
assigned environment. 






Score Conversion 

28-31^ A 23-27^8 18-22 ^ C 

Below 18 Student must resubmit assignment 



APPUCATION 






/Q. 












Before reading on, answer the following questions. Compare your answers to the 
ideas presented in the following section. 



1 . What patterns of student behavior did you discern? 

2. What inferences did you draw about Luci's learning and/or performance? 

3. Do you need more data? If so, what types of data would you like to collect? 

4. What would you suggest os a “next step” in order to improve Luci's learning 
and/or performance? 









After reviewing this assessment data, Mr. Giiiey began to 
look for patterns. One pattern that seemed particuiariy 
strong was Luci's performance on written assignments. 

It appeared that Luci rarely wrote in paragraphs. She 
appeared to use labels or phrases in lieu of complete sen- 
tences or organized, cohesive paragraphs. Another pattern 
that emerged was Luci's consistent high performance on 
selected response items. This behavior seemed more in line 
with Mr. Gilley's own perceptions of Luci's performance. 

Once he had noted patterns in Luci's performances, Mr. 
Gilley reflected on underlying causes for these patterns. 

He began to formulate inferences about her academic 
behaviors. It was obvious from the examination other 
selected response work that Luci was learning science 
concepts. It was also obvious that writing was a problem 
for Luci. Mr. Gilley tried to list possibilities for this problem: 

a) Luci has not had sufficient instruction in writing skiiis. 

b) Luci has had iittie experience in writing. 

c) Luci has a specific iearning disabiiity reiated to writing. 

In order to narrow these possibilities, Mr. Gilley decided he 
needed more information. He would not be able to test or 
verify his inferences without collecting additional data. 
First he consulted Luci's cumulative folder to see if she had 
ever been tested for specific learning disabilities. He found 
that she had been tested, but that no specific learning 
disabilities were found. He then talked with Luci's current 
language arts teacher, Mrs. Riley, about her performance 
in that class. Mrs. Riley reported that she, too, had noticed 
Luci's reluctance to attempt and poor performance on, 
written assignments. She told Mr. Gilley that she had begun 
to work closely with Luci, giving her individual attention 
and extra help in order to develop her writing skills. Finally, 

Mr. Gilley also conferenced with Luci and her parents and 
shared Luci's assessment results with them. This conference 
revealed that Luci felt unprepared to write and that she was 
reluctant to "show her ignorance" as she put it by writing. 
She had had some bad experiences in the past feeling 
ridiculed and belittled by some of the feedback she had 
received on her writing. Because she found writing to be 
risky to her self-esteem, Luci avoided it 

Through this data collection process, Mr. Gilley obtained 
sufficient information to draw conclusions about Luci's 
writing performance. He concluded that Luci's poor writing 
skills were the result of insufficient instruction and insufficient 
practice, rather than attributing this poor performance to a 
specific learning disability. Once he had identified the prob- 
lem through the data analysis process described here, 

Mr. Gilley was ready to plan instructional modifications to 
enhance Luci's academic performance in his science class. 




Instructional Modifications 

In the above data analysis example (Mr. Gilley and Luci), we saw that 
after the analysis process, Mr. Gilley's next step was to plan needed 
instructional modifications. By analyzing his students' (Luci's and all 
the other students') work and then designing instructional strategies 
to meet the needs of his students, Mr. Gilley demonstrated on exam- 
ple of differentiation in the classroom. 

Differentiation 

When teachers modify instructional plans to meet the needs of spe- 
cific students, they differentiate instruction. Differentiated instruction 
simply means "teachers reacting responsively to a learner's needs" 
(Tomlinson & Allan, 2000, p. 5). Teachers who differentiate instruction 
know that one size does not fit oil. They recognize that they teach a 
group of diverse learners with different learning styles, interests, and 
prior experiences. 

Almost oil teachers differentiate instruction to some extent. For exam- 
ple, when a lecturing teacher notices o puzzled expression on o stu- 
dent's face, she may pause, and then present the information again, 
but use o different manner of explanation this time. In this process, 
she is reacting to an individual student's needs and adjusting her 
instruction to fit that student's needs. In this chapter, we emphasize 
that teachers need to go beyond such stimulus-response reactions 
and think proactively about differentiation in order to moke differen- 
tiation o natural port of the classroom environment. 

In the past when we have urged teachers to differentiate in their 
classroom, we have heard o chorus of groans and muttered com- 
ments such os "Impossible!" and "Not enough time!" This led us 
to facilitate a discussion of differentiation to see if our definition 
matched that of the teachers. We found that it did not. Before we 
go further with a discussion of differentiation in this chapter, we need 
to clarify the term. We have already stated what differentiation IS 
(teachers responding to individual student needs). However, we will 
now explain what differentiation is not. 

Differentiation myths: 

1) Differentiation is equivaient to individuaiized instruction. 

"Individualized instruction" evokes images of students sitting with 
self-paced, self-checking booklets. In such classrooms, one envi- 
sions no peer interactions and few teacher/student interactions. 
Today, many home school students experience a new brand of 
individualized instruction— a one-on-one experience between 
the teacher and the student. We don't wont to re-institute the first 
description of individualized instruction, and we aren't able to do 
the second. In differentiated instruction, we ore not individualizing 
or tailoring instruction for each and every student. 



2) Differentiation is oniy important for iow-abiiity students. 

High-ability students need differentiation as much as low-ability 
students or they become bored, restless, and disruptive. 

3) Differentiation means that we can never have whoie-group 
activities. 

Everything in a differentiated classroom does not take place in 
small groups. Whole-group activities do occur. The teacher may 
demonstrate or model o skill or process— tell o story, shore infor- 
mation, etc. When we hold students accountable for the learning 
acquired through whole-group activities, we may do this in o vari- 
ety of ways. 

4) Differentiation requires abiiity grouping. 

Were you ever ability grouped? Didn't you figure it out, even 
though your group hod a non-judgmental name (e.g. the 
Redbirds)? Students will figure it out, too. Differentiated instruc- 
tion doesn't mean setting up the classroom to keep the low 
kids low and the high kids high— we don't wont any self-fulfilling 
prophecies. Rather, o differentiated classroom takes the "Army" 
approach: Be all that you con be. 

5) Differentiation means the “bright kids" have to do more. 

No, we don't expect the kids who catch on the fastest to do 
more- they may just do different things. They may be excused 
from some practice activities, os they've already "gotten" it— they 
don't need the practice. 

6) Differentiation means i have to incorporate choices for students in 
every assignment, activity, or assessment. 

Teachers should only differentiate instruction when there is o per- 
ceived need to do so. It is important to remember that teachers' 
classrooms aren't judged to be "differentiated" or "not differenti- 
ated" based on o single day or o single assignment. Differentiating 
means using o variety of instructional strategies and assessment 
methods os your students move through the curriculum. 

Returning to the example of Mr. Gilley's classroom, Mr. Gilley used 
the data analysis process to determine the needs of his students. 

We know he found that Luci needed extra practice and extra sup- 
port in writing skills. However, we don't hove the results of the other 
analyses he performed. Perhaps he found other students in the class 
who shared Luci's need for instructional modifications in writing. He 
certainly found students with other learning needs. By pinpointing 
the learning strengths and weaknesses of the students in his class, 

Mr. Gilley was able to plan instruction to meet the needs of oil his 
students. How might this be accomplished? One expert in differentia- 
tion, Carol Ann Tomlinson (1999), recommends planning for differen- 
tiation within three different dimensions of the classroom: Product, 
Process, and Content. The chart in FIGURE 5.3 provides definitions of 
these dimensions and gives examples of how teachers may differenti- 
ate in these areas. 



PRODUCTS PROCESS CONTENT 



FIGURE 5.3 

Dimensions of Differentotion 



DEFINITICN 


IDEAS FROM THE EXPERTS 


Facts, concepts, generalization, 
principles, attitudes, and skills 
related to the subject. Includes 
both what the teacher plans 
for students to learn and how 
the student gains access 
to the desired knowledge, 
understanding, and skills. 


Choosing text/novels/reoding assignments to address 
different reading levels. 

Present info one time os “break it down from fhe whole”; 
next time build it up from its ports. 

Use readings, videos, computer demos, audio 
recordings to present material. 

Use pre-assessments os journal entries, class discussions, 
quizzes to assess student readiness to learn. 

Provide o note-taking matrix to some students. 


How the learner comes to 
make sense of, undersfand, 
and “own” key facts, concepts, 
generalizations and skills. 
(Synonym for Activity) 


Encourage student conversations— 

hove them explain concepts in “kid language.” 

Soy, “This reminds me of...” to help students moke 
connections. 

Ask students to find patterns, categories, 
similarities/differences in data. 

Ask students to moke and justity judgments. 

Work with students to develop rubrics. 

Allow for varied working arrangements 
(work alone or in groups). 


Items the student can use to 
demonstrate what she has 
come to know, understand, 
and be able to do as the result 
of study (e.g., portfolio, test, 
project). 


Encourage students to express what they hove learned 
in various ways. 

Provide or encourage varied use of resources in 
preparing products. 

Provide product assignments at varying degrees 
of difficulty. 

Use o wide variety of assessment methods. 

Encourage culminating assessments such os 
“Senior Projects.” 



For example, Mr. Gilley knows that Luci and several other students 
in the class have poor writing skills. He could choose to differentiate 
in the "Process" dimension by providing "choice as to how students 
will express knowledge (write a paper, make a diagram, construct a 
model)." This would allow Luci and the other students to express their 
scientific knowledge in ways other than writing. However, if he wishes 
to enhance the writing skills of his students, Mr. Gilley may choose to 
differentiate in the "Products" dimension by allowing for varied work- 
ing arrangements (work alone or in groups). Here, he could team Luci 
or other students struggling with creating written products with more 
proficient peers or he could create sub-groups of struggling students 
within the classroom and spend more instructional time with them. 





APPMCATION 



"N 

Analyze your assessment data from the last grading period. 
Once you have identified student learning needs, list at least 
three ways you can differentiate within each of the three 
dimensions (Products, Processes, and Content). 

V y 



When teachers make instructional modifications by differentiating 
in one of these three areas (Content Products, Process), they ore 
responding to student needs. Before they con differentiate, they 
must first identify these needs through a data analysis process. In 
Chapters Three and Four, we explored different assessment methods 
teachers con use and different venues for providing feedback to stu- 
dents. In this chapter, we hove emphasized reviewing the assessment 
data and analyzing it to determine students' learning needs so that 
instructional modifications to meet those needs can be designed. 
With this chapter, we complete the Classroom Assessment Cycle. Our 
final chapter provides further examples, showing the complete imple- 
mentation of this cycle in science classes. At the end of this manual, 
you will also find on Appendix that discusses ways of communicating 
assessment results to parents and students. 






Iaapleaaentin& 
THE Cycle 

This manual has introduced the Classroom Assessment Cycle os a 
means of enhancing student achievement. The Cycle prescribes a 
process that teachers can use to align their classroom assessments 
with research-based best practices. By implementing all four quad- 
rants of the Classroom Assessment Cycle, teachers can: 

1) Clarify science learning targets for their students. 

2) Gather assessment evidence in a variety of ways and provide 
meaningful feedback to students on their academic progress 
in science. 

3) Analyze assessment data to determine the learning needs of 
the students in science classes. 

4) Modify science instruction to meet those learning needs. 

(See FIGURE 6.1.) 




FIGURE 6.1 
The Classroom 
Assessment 
Cycle 



In this last chapter of the manual, we provide examples of science 
lessons that demonstrate how teachers at different levels may imple- 
ment the Classroom Assessment Cycle. We think of these examples 
as snapshots from the classroom. They focus on only one day's les- 
son, rather than on a complete unit. The examples provide descrip- 
tions of previous activities, detailed plans for today's lesson, and 
a foreshadowing of tomorrow's science class. By studying these 
examples, teachers may obtain a clearer idea of how the Classroom 
Assessment Cycle works on a day-to-day basis in the science class- 
room. Teachers may then feel prepared to design lessons (and even- 
tually whole units) using this assessment approach. 

The first classroom snapshot is from Mr. Tighe's third-grade class. The 
chart in FIGURE 6.2 categorizes the classroom activities into particular 
quadrants of the Classroom Assessment Cycle. This chart is followed 
by two FIGURES (6.3 and 6.4) that provide more information about the 
assessment and instruction in this classroom. Finally, FIGURE 6.5 provides 
a summary of the learning targets, assessments, feedback, data anal- 
ysis, and instructional modifications found in this third-grade lesson. 



FIGURE 6.2 

Elementary Level Implementation of the Classroom Assessment Cycle: Third Grade, Mr. Tighe's Class 





0) Q) 
^0.0. 
® 0) I 

E 

c > 

CD <D ~ 
"O ^ > 

« O) « 

C C o 

o z o 
"O tl 

“ O) f- 

e o 
-Q > ° 

« O X 

c c o 

o 73 « 

CD ° <D 
SI U)TS 
C3) C D 
i— ■> <n 
^ 0) 
^ o£ 



■5; Q- ® 

= E a 

0 o £ 

0) Q. o 
■ti — X 
o> := 0 

® 5 0) 

1 ® E 

o 

« « « 
o P-- D) 

E|g>i 

|o,o.| 

^ 

« ^ 12 j- 

O ^ CD 
■5 O 12 73 
E S C = 

O) w '35 0 
C CO -t 
■5 CD 3 O 

= o u => 

^ D - E 

o E 73 

C O <0 CO 

oo'£0 



< 

o 

(/) 

o 



o 

< 



c 

<D 

■D 

_2 

"co 

.C O 



(/) 

< 
z 
< 

I- i .bi i: 



g 0 0 
</) "D 0 
<0 D t 

UJ — » 

cfl o 73 

"3 ^ ^ 
O o 

0) 

^ n 
C3) ^ 

i= CD 



C O 
F C 
CD 






CD 

5 

to 

C 

g 

Q. 

CD 

o 

c 

o 

o 



CD -:r 



5 ^ 5:^ E 



lu 

</) 

< 



73 

CD 

to 

D 

CD 

C3) 



D) 

c 

> 

O 



to o 

D "- 



O) 

C CD 

■> £ 

1| 
® i 
E ° 

oS> 



73 

CD 



CD 

SI 

U) 



o 

73 

c 



g CD .— ii 



o c 



C 
D 

a 
o 

0 ) 

O) r 
3 o 

OQ Q. 

:□□□□□ 



■iC' to 



D) — 
c ■“ 



C3) £ O 
C J 



O ^ 
C 

O o 
CL O 



■D 

D 

O 

13 

O C3) 
O O 

CQ LL 



□ □ □ □ 



0) Q. 

73 O 
O CD 
D) o 
£ 

CO "O 

£3 

-t- CO O 

^ CO CO 

P c P> 
■5 O.E 
^ 73 73 
3 ^ ^ 

CO CO O 

P = *co 



C = 
0 O 
F C 
CO F ^ 

E §0 

CO ^ 

1 



0 ) o 

73 — 



c -iii. 



o Q> 
CD £ 



CD 



O 



D . 



CD 



to "P 

< I C 



CD 

§i 

CD D) 

iE o 



2> I 6 

o ? o> 

^ CO 



P 0) 

-- q: 

CM 



P> £ 



« 2“ 

® E o 

o ~ C 

> c o 
o p 7= ^- 

CD P) P o 

^ oz o- 



?■ 5 □ 



□ □ 



■5^§ = 

0 CO CO 

x: 0 = -p 

0 ) 2 3 73 

5 CD .E *” 

c-pe 

3 = ® P 
^ CD O 73 

o£ ^ £ 
c o 



o P 



73 c 

° « 

3 C 
O CD 
73 > 

D CD 

^ 

:§ P 

CO . C 
0 CO 0 

CT I I 

O C P 
« P) '| 
O O 



c 
D g 

o D 

3 C 

c a 

o X 

o CD 
O £ 



CM 

CO — CM 
CO — 

-od 
,P)P§ 
CD . 

> to = 

•E c: o 

£:f § 

D P O 

UJi 

3 Q. O 

P O 

p « 

,9 £ CD 

U D q: 



□ □ 




o 

to 

LO 



< 

o 

o 



uu "D 
O. ig 
in w 



-5 I 

O ^ : 

C O : 
crt E 
D O 
CD ® 

c/> O 
CD 

CD O 
D 

3 D 

□ 



0 ) o> 

<o C 

c 

D 

= O 
> w 
CD D 

5 o 

ii 

£ o . 

lgl 



D) 

l| 

^ ^ to 
D 2 O) 
^ o £ 

°lf 

^ CD O) 
O O ~ 



to 



^ 0 )c 
D E o 

^£f 

□ 



^ o 

of 

-O ® 

c £ 

to 

= ^ 

»- O) 

3 C 

O 'c 
0) O </i 
^ to to 
^ D O 

^ 9 > o 

□ 



CD 

D 5 

to "o 
O o 

CD O _ 
n O 



CD 

O 

C ^ 

.9>| 
o > 

CO -o 
■= C 



D) 

c 



O p 



iE ^ 
O 
CD 

a 

V3 



TS 

0) 2 
® O) 



■D 

C 

«5 

£CO 

« c 

C p 

2>8 

O 3 

O LU 



o 

"D 

P 

> 

D ® 
w C 
« O 
0) . 

CD ^ 

c '5 
« CT 
CD c 



5 -!2 E E - i= 



= O 

n E 

Pc 

■<d-9> 

c o 



bii, CD 



O 

CD 

!q 

o 

D) 

c 

■> 



>- O) 
CO c 



<D S 



O 

u 

O) 

c 

c 

o 

<« 

o 

Qi 



a o 



C M_ 

■> o 

'— tn 

cP 

0 “w 

C -c 

g>o 

1 s 

= D 
« ^ 
O O 



o 

(A 

o 



D) 



"c c~ 

^ O) > 
C ^ 



Eo> 

^■i 

CD = 

CD ^ 
^ rii 



CD 

V3 

n 

O 

N_X 

JO 

V> 

to 

to 







CLARIFY GATHER EVIDENCE ANALYZE MODIFY 

LEARNING TARGETS IN A VARIETY OF WAYS ASSESSMENT DATA INSTRUCTION 
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FIGURE 6.5 Summary of Classroom Assessment Elements Found in Mr.Tighe's Third-Grade Science Class 
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In Mr. Tighe's classroom, the primary formative feedback mechanism 
was verbal feedback. The following middle school example from Mrs. 
Nagura's seventh-grade science class (See FIGURE 6.6) also utilizes 
verbal feedback. However, In this example, written feedback In the 
form of quiz grades, comments on science journals, and rubrics Is also 
present. Both examples Incorporate diagnostic as well as formative 
assessments, use a variety of assessments and teaching methods, 
and emphasize the continuous analysis of assessment data to Inform/ 
modify Instruction. The middle school example, like the elementary 
one. Is followed by a summary chart of the learning targets, assess- 
ments, feedback, data analysis, and Instructional modifications used 
(See FIGURE 6.11). 



FIGURE 6.6 

Middle School Level Implementation of the Classroom Assessment Cycle: Seventh Grade, Mrs. Nagura's Class. 
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FIGURE 6.8 

Creating a Baby Creature 



Names of Group Members: 

1) Each student group will receive a packet of chromosomes. The yellow chromosomes 
belong to Mom Creature and the block chromosomes belong to Dad Creature. 

2) Notice that the chromosomes appear to be of different lengths and that each 
chromosome is labeled with o letter. 

3) Lay oil the chromosomes face down (so the letter is hidden) on the table. Sort them 
into some size groups (each some size group should contain four chromosomes 
(two yellow and two block). 

4) Choose one yellow chromosome and one black chromosome from each same 
size group. Leave the chosen chromosomes on the table. Put the remainder of the 
chromosomes back in the packet and put these aside. 

5) Chromosomal analysis on the Creatures has shown that each chromosome con- 
tains only one gene. This one gene codes for o particular trait. The attached chart 
shows you the genetic code (genotype) for each trait beside the physical descrip- 
tion (phenotype) of this trait. You will use this code to create your Creature offspring: 



GENOTYPE 


PHENOTYPE 


MATERIALS USED 
TO CREATE THIS TRAIT 


AA 


1 antenna 


1 small noil 


Ao 


2 antenna 


2 small noils 


oo 


No antenna 




BB 


1 yellow hump on back 


1 yellow mini-morshmallow 


Bb 


2 yellow humps on bock 


2 yellow mini-marshmollows 


bb 


3 yellow humps on bock 


3 yellow mini-morshmallows 


CC 


Green nose 


1 green mini-marshmallow 


Cc 


Yellow nose 


1 yellow mini-morshmallow 


CC 


Pink nose 


1 pink mini-marshmollow 


DD or Dd 


Curly toil 


1 pipe cleaner, curled 


dd 


Straight toil 


1 pipe cleaner, not curled 


EEorEe 


2 eyes 


2 thumbtacks 


ee 


3 eyes 


3 thumbtacks 


FF or Ff 


4 blue legs 


4 blue push pins 


ff 


4 red legs 


4 red push pins 


GG or Gg 


3 body segments 


3 large marshmallows 


gg 


2 body segments 


2 large marshmallows 




6) Turn your mixed black and yellow chromosomes over and write down the codes 
you find here. 

7) Use the chart above to convert the genotype (the two letters) into the pheno- 
type (what the baby will look like). Gather the materials you need and assemble 
your baby Creature. You may use some toothpicks to hold your baby together. 
The toothpicks should be hidden, not visible when your baby Creature is finished. 
(Think of the toothpicks as bones.) 

8) Write the names of the two members of your group on a piece of paper. Put this 
piece of paper in the Creature playpen (at the front of the room on the teacher's 
desk) and place your baby Creature on this paper. This piece of paper serves to 
identify your baby Creature. 



FIGURE 6.9 

Discussion Sheet for Creating a Baby Creature 



1 . How many baby Creatures were constructed in today's class? 

2. How many were exactly alike (identical twins)? 

3. How many were different (in at least one way) from all the others? 

4. If there are 21 different genotypes possible for the baby Creature, are there the 
same number, more, or fewer phenotypes that are possible? Explain your answer. 

5. You started this activity with 28 chromosomes (14 from Mom and 14 from Dad). 
You discarded 14 chromosomes (7 from Mom and 7 from Dad). Look at the chro- 
mosomes you discarded. If you had made your baby Creature using these chro- 
mosomes, would it have looked the same as the one you actually made? Explain 
your answer. 

6. Why was it necessary to reduce the number of chromosomes (from the original 
28 chromosomes to only 14) before creating your baby Creature? Explain. (In 
genetics, this "reducing" process is known as meiosis.) 

7. Select the A (A or a) yellow chromosomes you used to create your baby 
Creature. Tape the yellow B (B or b) to the bottom of this chromosome. Do the 
same to your A (A or a) and B (B or b) black chromosomes. The A and G chro- 
mosomes are now linked together. How might this affect the genetic diversity? 
(Increase it? Decrease it? Why?) 



FIGURE 6.10 

Linked and Unlinked Genes 



Mom’s Genes on Chromosomes 







— 










A 




a 




B 




b 



Mom’s Possible Genotypes 



Dad’s Genes on Chromosomes 




Dad’s’s Possible Genotypes 



AB 

Ab 

aB 

ab 



AB 

Ab 

aB 

ab 



PUNNET SQUARE OF 
DIHYBRID CROSS- 


DAD’S POSSIBLE 
GENOTYPES 


UNLINKED GENES 


AB 


Ab 


aB 


ab 


(/> 
lAJ UJ 


AB 


1) AABB 


2 ) AABb 


3 ) AaBB 


4 ) AaBb 


w J t 
" tt > 

^ M ^ 


Ab 


2 ) AABb 


5 ) AAbb 


4 ) AaBb 


6 ) Aabb 


O </} o 

^ S 


aB 


3 ) AaBB 


4 ) AaBb 


7 ) aaBB 


8 ) aaBb 


A 111 

O 


ab 


4 ) AaBb 


6 ) Aabb 


8 ) aaBb 


9 ) aabb 



Offspring's genotypes are shown in shaded squares. Numbers indicate different 
phenotypes shown beiow. if same number appears beside two genotypes, they 
code for the same phenotype. 

9 Possible Phenotypes for Baby Creature 

1) Baby's genotype is AABB, so baby has 1 antenna, 1 yeiiow hump 

2) Baby's genotype is AABb, so baby has 1 antenna, 2 yeiiow humps 

3) Baby's genotype is AaBB, so baby has 2 antenna, 1 yeiiow hump 

4) Baby's genotype is AaBb, so baby has 2 antenna, 2 yeiiow humps 

5) Baby's genotype is AAbb, so baby has 1 antenna, 3 yeiiow humps 

6) Baby's genotype is Aabb, so baby has 2 antenna, 3 yeiiow humps 

7) Baby's genotype is aaBB, so baby has no antenna, 1 yeiiow hump 

8) Baby's genotype is aaBb, so baby has no antenna, 2 yeiiow humps 

9) Baby's genotype is aabb, so baby has no antenna, 3 yeiiow humps 





FIGURE 6.10 

Linked and Unlinked Genes (continued) 



Mom’s Genes on Chromosomes Dad’s Genes on Chromosomes 





Mom’s Possible Genotypes 

AB 

ab 



Dad’s’s Possible Genotypes 

AB 

ab 



PUNNET SQUARE OF 
DIHYBRID CROSS— 


DAD’S POSSIBLE 
GENOTYPES 


LINKED GENES 




AB 


ab 


MOM’S POSSIBLE 


AB 


1) AABB 


2) AaBb 


GENOTYPES 


ab 


2) AaBb 


3) aabb 



Offspring's genotypes are shown in shaded squares. Numbers indicate different 
phenotypes shown beiow. if same number appears beside two genotypes, they 
code for the same phenotype. 

3 Possible Phenotypes for Baby Creature 

1) Baby's genotype is AABB, so baby has 1 antenna, 1 yeiiow hump 

2) Baby's genotype is AaBb, so baby has 2 antenna, 2 yeiiow humps 
2) Baby's genotype is aabb, so baby has no antenna, 3 yeiiow humps 





FIGURE 6.11 

Summary of Classroom Assessment Elements Found in Mrs. Noguro's Seventh-Grade Science Class 
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FIGURE 6.12 

High School Level Implementation of the Classroom Assessment Cycle: Tenth Grade, Mrs. Green's Class 
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FIGURE 6.13 

Problem Presentation: Hospital Memo 



Inter- Office Memorandum 



To: 



Dr. Kay Emko 



From: 



Hospital Administrator 
Dr. Jim Reed 



Date: 

Re: 



Chief Executive Office 
Nov. 14, 2005 



Smith conjoined twins 



As you know, Dr. M.E. Seay has contacted this office about the possibility of 
performing separation surgery on conjoined twins. These twins, born in Romania, 
were adopted this month by Wilbur and Darlene Smith. I am forwarding to you 
an anatomical schematic, provided by Dr. Seay, which shows the body structures 
shared by the twins. Please assemble a planning team, which will include medical 
staff, accountants, legal experts, hospital facility (physical plant) representative, and 
counselors to consider the feasibility of the hospital pursuing this undertaking. 

The hospital board will meet on Nov. 23, 2005, at 9:00 a.m. I would like for your 
team to present its recommendations at this meeting. 

The attached anatomical schematic shows the following: 

• The twins are two children above the waist and one below 

• 1 rib cage 

• 1 liver 

• 2 gall bladders 

• 1 large intestine 

• 1 small intestine 

• 1 pair of ovaries 

• 1 pelvis 

• 1 vagina 

• 1 urethra 

• 2 spinal cords that fuse at the base 

• 3 or 4 lungs 

• 2 hearts with a common bloodstream 

• 2 stomachs 

• 1 right kidney 

• 2 left kidneys 

• 1 uterus 

• 1 bladder 



FIGURE 6.14 

Rubric for Conjoined Twins Oral Presentation by a Student Group 
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EXCELLENT 


ADEQUATE 


POOR 


Relaxed, self-confident, 
and appropriately dressed 
for purpose or audience; 
builds trust and holds 
attention by direct eye 
contact with oil ports of 
the audience; fluctuations 
in volume and inflection 
help to maintain audience 
interest and emphasize key 
points. 


Quick recovery from 
minor mistakes, but some 
tension or indifference 
apparent; appropriately 
dressed; occasional but 
unsustoined use of eye 
contact with audience; 
uneven volume; variation 
in inflection is present, 
but not consistently 
implemented. 


Nervous tension obvious; 
inappropriately dressed; 
no effort to moke eye 
contact with audience; 
low volume and/or 
monotonous tone causes 
audience to disengage. 


Effectively captures 
interest/attention of 
the audience; lays out 
plan and sequence of 
presentation; informs 
audience of group role. 


Lays out plan and 
sequence of the 
presentation, but no 
attention-getting “hook” is 
present, informs audience 
of group role. 


Introduction is missing- 
no plan for the presenta- 
tion is given. Group role not 
identified. 


Shares the group’s problem 
statement, explains why 
the group chose this 
statement; proposes o 
solution to the problem, 
explains rationale for 
the solution, justifies why 
solution is sound and 
workable. 


Shares problem statement; 
proposes o solution; 
justifies the solution. 


Problem statement is 
missing; group proposes o 
solution but foils to justify 
why it is o sound and 
workable solution. 


Understood problem well 
enough to answer ques- 
tions posed by audience; 
provided quality answers 
to question— answers were 
based in facts, clearly 
related to question asked, 
and provided audience 
with needed information. 


Attempted to answer 
questions, but answers 
were of less quality— 
occasionally facts 
were misrepresented or 
inaccurate. 


Unable to provide answers 
to audience questions. 


Each member of team 
provides on overview 
of his/her contributions 
to the solution of the 
problem; each team 
member speaks during 
the presentation; oil team 
members field questions at 
the end from the audience. 


Each team member speaks 
during the presentation; 
presentation provides 
evidence that most team 
members contributed to 
the solution. 


Not oil team members 
speak during the presenta- 
tion; contributions of indi- 
vidual team members not 
addressed. 


Solution aligns with prob- 
lem statement (actually 
solves the problem pre- 
sented); solution addresses 
oil constraints listed in orig- 
inal problem statement; 
solution is authentic to 
group’s role; solution hon- 
ors deadline and budget; 
solution is feasible within 
the time frame given in the 
problem and is completely 
supported by research. 


Solution aligns with 
problem statement; 
solution addresses most of 
the constraints; solution is 
authentic to group’s role; 
solution honors deadline 
and budget; solution is 
partially supported by 
research. 


Solution foils to align with 
problem statement; foils 
to address majority of 
constraints; solution is 
not authentic to group’s 
role; solution foils to honor 
either the given deadline 
or budget; research 
support for the solution is 
missing. 




FIGURE 6.15 

Directions for Justification 



1) From all the solutions proposed today, select the two "best" solutions. Briefly describe 
these solutions. 

2) Construct a "pro" and "con" chart for each of these two proposed solutions. 

3) List positive and negative consequences of Implementing each solution. 

4) Write a paragraph, stating the one "best" solution and explain your rationale for 
choosing this one over the other one. 



5) Use the following rubric to maximize your performance on this justification task. 



CRITERIA 


GOOD 


ADEQUATE 


POOR 


SOLUTION 

SELECTION 


Solutions are selected 
from those presented 
today; solution 
descriptions are 
accurate, complete, and 
clear. 


Solutions are selected 
from those presented 
today, solution 
descriptions are 
accurate, but may be 
incomplete or lacking in 
clarity. 


Solutions are selected 
from those presented 
today, but descriptions 
are missing or hard to 
understand (insufficient 
detail is given to check 
accuracy of solution 
description). 


PRO/CON 

CHART 


At least four “pro” and 
four “con” statements 
are listed for each 
of the two proposed 
solutions: “pros” and 
“cons” accurately reflect 
implementation issues 


At least three accurate 
“pro” and three accurate 
“con” statements are 
listed for each of the two 
proposed solutions.. 


Less than three accurate 
“pro” and three accurate 
“con” statements are 
listed for each of the two 
proposed solutions. 


CONSE- 

QUENCES 


At least two positive and 
two negative outcomes 
(consequences) of 
implementing each of 
the two solutions is listed 
and explained. 


At least two positive and 
two negative outcomes 
of implementing each of 
the two solutions is listed. 


Less than two positive and 
two negative outcomes 
of implementing each of 
the two solutions is listed. 


JUSTIFICATION 

PARAGRAPH 


Paragraph clearly 
identifies and explains 
the “best” solution; 
student provides at least 
three viable reasons for 
choosing this solution 
over the other. Paragraph 
contains no grammatical 
errors. 


Paragraph clearly 
identifies and explains 
the “best” solution; 
student provides at least 
two viable reasons for 
choosing this solution 
over the other. Paragraph 
contains no more than 
two grammatical errors. 


Paragraph identifies the 
“best” solution, but fails to 
explain it. Less than two 
viable reasons are given 
for choosing this solution 
over the other. Paragraph 
contains more than two 
grammatical errors. 




FIGURE 6.16 

Summary of Classroom Assessment Elements Found in Mrs. Green's Tenth-Grade Biology Class 
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The examples in Chapter 6 provided insights into ways teachers 
implement the Classroom Assessment Cycle to enhance student per- 
formance in science. The focus was on formative assessment (assess- 
ments that provide feedback to students while they ore learning.) 
Therefore, the classroom snapshots did not address grading issues. For 
an overview of productive grading practices, you may wish to read 
the Appendix below. 

Appendix: Grading and Reporting Student 
Performance in Science 

In Chapter Five, we examined methods to use for analyzing assess- 
ment data and then various means for modifying instruction based 
upon such analyses. In this chapter, we will concentrate on ways 
to provide meaningful performance reports to both students and 
parents. In order to offer such meaningful reports, we must first 
review the three purposes of assessment: Diagnostic, Formative, and 
Summative. 

These three terms were first introduced in Chapter Four. The definitions 
for the terms were: 

Diagnostic Assessments. Used to ascertain the current level of 
understanding that students possess about a concept prior to 
instruction. 

Formative Assessments: Used to provide feedback to students 
os they progress toward a learning goal, during the learning 
process. 

Summative Assessments: Used to rote the proficiency of students 
relative to o particular skill or skill set at the end of instruction. 

Diagnostic assessments ore primarily used by teachers to aid in plan- 
ning meaningful instruction. While the results of such assessments 
may be shared with students or parents, it is usually the cose that 
teachers alone review these results. Students, however, ore the pri- 
mary audience for reports of formative assessments, information that 
is often communicated to them through the use of verbal feedback, 
written feedback or rubrics. Through such feedback mechanisms, 
students come to understand their strengths and weaknesses. Of 
course, the results of such formative assessments may also be shared 
with parents. Summative assessment reports, those coming at the 
end of a grading period, are usually more formal than formative 
ones, and they ore distributed in o format usually chosen by the dis- 
trict or county educational agency. This agency may also prescribe 
the frequency of such reporting (os every three, six, and nine weeks, 
etc.). Summative grades are those grades that we find on students' 
report cords. 

In this appendix, we shift our focus from formative feedback or "scor- 
ing" to "grading" (summative feedback). We will consider how to 
enhance the reliability of summative grades found on student report 
cards, and we will examine methods of calculating such grades. 
Finally, we will look at a grade book format that supports productive 
grading plans. 




Grade Reliability 

Grades are reliable when they give an accurate picture of student 
achievement relative to a learning target or standard. Brookhart 
(2004, p.lO) describes this os: 

The grade students get for an assignment should be the 
closest possible estimate you can make about their real 
achievement on whatever the assignment is supposed to 
measure. It shouldn't be a fluke— a low score underestimat- 
ing achievement because a student was ill, or a high score 
overestimating achievement because the student made a 
lucky guess studying only the one question you happened to 
pick for a test. It shouldn't be an accident of which teacher 
grades the paper— a "hard grader" or an "easy grader." 

The grade should indicate the level at which students would 
generally register on any assignments designed to indicate 
achievement of the same learning goals. 

In this quote, Brookhart emphasizes two important components of 
grades: accuracy and consistency. Like other assessment experts, 
(O'Connor, 1999 & 2002; Morzono, 2000; Stiggins, 1993; Gusky, 2004), 
Brookhart insists that grades should be a measure of student achieve- 
ment relative to some standard. If we accept this grading definition, 
then grades should not include such non-ocodemic factors os atten- 
dance, conduct, cooperation, ability, or attitude. O'Connor (1999) 
suggests that such non-academic factors be recorded and reported 
separately. 

In addition to basing summotive grades on academic performance, 
four other factors highly impact grading accuracy and consistency. 
These are: 

1) Which assignments are "counted." 

2) How much these assignments count (weighting). 

3) The statistical method used to calculate final grades. 

4) The use of zeros. 

What Counts? 

To decide which grades should "count," (that is, which assignment 
grades will be used to create the summotive, report card grade), 
we must first return to the three purposes of assessment: diagnostic, 
formative, and summotive. Should grades on diagnostic assignments 
be used to figure final grades? Most teachers would answer "no" to 
this question. Diagnostic assignments are given prior to any instruc- 
tion. They ore simply the measure of what o student already knows 
about a concept or subject. The purpose of diagnostic assessments 
is to plan appropriate teaching. We use diagnostics to prevent over- 
dwelling on concepts that students already understand and to focus 
our instruction on possible misconceptions and/or perceived weak- 
nesses in student understanding. Diagnostic assessment results ore 
not usually used in calculating summotive grades. 

The use of formative scores is more problematic, however. Even 
though, by definition, formative scores ore taken while students are 
still form/ng their understandings of concepts, teachers routinely 
record these scores as grades and then use them to calculate over- 
all, final grades for the report card. Whether this results in accurate. 



consistent grades generally depends on where a particular student 
is located on the learning curve when the grade is assigned. If the 
student has hod time to understand the concept presented and has 
been afforded sufficient practice time, then the grade may accu- 
rately reflect student achievement. The reverse may be true, however, 
for students lower on the learning curve! 

Ken O'Connor dramatically illustrates the problematic nature of 
using formative assessments in the chart below. (See FIGURE A.l.) In 
this graphic, we find the proficiency levels of students learning to 
pack parachutes measured over time. Student A begins with above- 
average proficiency, gains in proficiency, and then decreases in 
proficiency at the end of the recorded time period. Student B shows 
a lack of consistency in performance, with proficiency rising and fall- 
ing many times throughout the recording period. Student C began 
with very low proficiency, but has gradually increased in proficiency 
throughout the recording period. The question arises: At the end of 
the recording period, whom do you wont to pack YOUR parachute? 

FIGURE A.l 

Parachute Packers 



Student A 



Student B 
Student C 




Tiaae 



O'Connor, K. (2002). How to grade for learning. Thousand Oaks, CA: Corwin Press. 
p.33. Figure intro. 11 

When asked this question, most participants in O'Connor's workshops 
will answer that they would like Student 0 to pack their parachute. 
Student 0 has gradually improved in performance and has the high- 
est proficiency rating on the day of the jump. O'Connor then asks 
teachers to speculate about the students' summative grades. Which 
student would have the highest grade point overage at the end of 
the recording period? Most participants agree that Student A would 
have a higher overage, since Student A has more data appearing 
over the "overage proficiency" line on the graph. 

This "Parachute Packing" graphic, then, illustrates the problematic 
nature of using formative grades to calculate ending grades. Should 
Student 0 have a lower average than Student A, simply because he 
took longer to learn? Should oil the data points on the graph (some 
taken while students were still learning, i.e., formative grades) be used 



to figure the final, overall grade? O'Connor emphasizes that if we sim- 
ply want to know if the students can pack a parachute, the use of all 
data points is unnecessary and actually unfair, particularly to Student 
C. O'Connor further stipulates, "I believe formative assessment has no 
place in determining report card grades" (O'Connor, personal com- 
munication, October 3, 2005). 

How Much Does the Assignment Count 
(Weighting)? 

It is not uncommon for teachers to weight assignments differently. By 
"weighting," teachers assign relative importance to assignments. For 
example, a teacher may count homework as only 10% of the final 
grade, but the final exam may count 50%. Such differential weighting 
can have a huge impact on the final grade. Let's look at a research 
study conducted by Marzano (2000). 

In this study, Marzano asked two teachers who were team-teaching 
the same set of students to figure final grades for their 26 students. 
Both teachers used the same set of recorded grades and both 
worked independently to calculate ending grades. When the grades 
for the 26 students were compared, Marzano (2000, p. 5) found: 

• One student was assigned grades that differed by three letter 
grades (e.g., A vs. D). 

• Two students received grades that differed by two letter 
grades (e.g., A vs. 0). 

• Eight students' grades differed by one letter grade 
(e.g., A vs. B) 

• Fifteen students received the same grade from both teachers. 

• In all, there was agreement on only 15 of 26 grades or 57.7% of 
the grades. 

Marzano found that the differences in grading arose from the 
weighting of the assignments. One perhaps counted performance 
assessments as more important than homework assignments, or one 
weighted the chapter test more than the other. 

This research study illustrates how weighting of assignments can 
impact summative grades. The same data set of assignments can 
result in widely divergent ending grades, depending on the weight- 
ing scheme used. To ensure valid grades, the weighting scheme 
should be one that supports accuracy and consistency and one that 
is closely aligned with academic standards. 

Statistical Methods Used 

Throughout recent grading history, teachers have tended to average 
all grades in order to get final grades. "Averaging" uses the statistical 
method of finding the mean. Flowever, there are two other statistical 
methods that could be used in calculating final grades. These are 
the mode and the median. Let's take a set of scores for one student 
and compare the final grade obtained from using all three statistical 
methods. Flere are Taquisha's scores for the third grading period in 
science: 93, 94, 93, 67, 98, 91, 90. Using the mean (or "averaging") to 
figure Taquisha's grade, we would add them all together and divide 
by seven (the number of grades). By doing this, we would obtain an 



average of 89.4 for Taquisha. On o typical grading scale of 80-89% = 
B, Taquisha would have a very high B. 

Using the statistical method of the mode means that we simply look 
for the grade that occurs most often. For Taquisha, this grade would 
be 93, since she earned this grade twice. Therefore, if we use the 
mode to figure Taquisha's final grade, she would get a 93 (low A) on 
her report card. 

Using the median is the third choice we have. To find the median, 
we must first put Taquisha's grades in numerical order, from highest 
to lowest. This gives us: 98, 94, 93, 93, 91, 90, 67. The second step is to 
look for the grade "in the middle." Since there ore 7 grades, we look 
for the grade that would hove 3 grades above it and 3 below. (If 
Taquisha hod hod on even number of grades, we would simply over- 
age the two in the middle to find the median.) This "middle" grade 
is 93. Therefore, if we use the median to calculate Taquisha's final 
grade, she would again receive a 93 (low A). 

Which of these three statistical methods leads to the most mean- 
ingful (or appropriate) summary of Taquisha's work? If we look at 
all of Taquisha's grades, on a grading scale where an A = 90-100%, 
Taquisha only has one grade that is not in the A range (67). This one 
grade of 67 results in dropping Taquisha's final grade from an A to a 
B when using strict averaging (the mean) to figure Taquisha's grade. 
The other two statistical methods both calculate A's for Taquisha. 

Teachers must use their professional judgment to decide how to cal- 
culate final grades. Informing this judgment should be the question, 
"Which method results in a grade that most closely aligns with the 
student's actual academic performance?" If an outlying score (one 
significantly different from most of the others) appears to unduly influ- 
ence the final grade, the teacher may wish to examine other statisti- 
cal means of calculating final grades. Teachers may also want to use 
caution in implementing the mode as a means of figuring the final 
grade. The mode is fairly unstable as an indicator of student achieve- 
ment. Consider the following set of grades for Kenon: 92, 67, 94, 96, 

99, 67, 91. Using the mode in Kenon's case would result in a grade of 
67, as opposed to a mean of 86.6 or a median of 92. Actually, Kenon's 
two grades of 67 appear to be outliers (unusual grades for Kenon). 
Therefore, the strict use of the mode con result in grades that do not 
accurately reflect student achievement relative to standards. 

The Use of Zeros 

Our working definition of grade reliability states that grades accu- 
rately depict student academic performance relative to a learning 
target or standard. A grade of zero rarely provides such an accurate 
reflection of what a student has learned. Zeros are most commonly 
used to denote absent or missing assignments. However, when such 
zero scores are used to calculate summative, report card grades, 
their effects may be greatly magnified and therefore, zeros may neg- 
atively impact grade reliability. Let's look at a set of scores for Jason. 

In science class this term, Jason has earned the following set of marks: 
90, 92, 88, 90, 91, 89, and 91. Simple averaging of these scores gives 
Jason an overall final grade of 90.1 . Adding in a single zero to this set 
of scores, however, drops this average to 78.9. Which final score more 
accurately reflects Jason's performance in science this term? 



Teachers often defend the use of zeros by stating that they cannot give 
students credit for missing work. Whiie this is true, many faii to assess the 
huge negative impact that even one zero can have on a student's end- 
ing average. How can the effect of zero scores be minimized? 



O'Connor (1999) states that teachers consider giving the highest pos- 
sibie "F" score on a missing assignment, rather than recording a zero. 
For exampie, if the grading scaie uses "0-69" as an "F," the teacher 
wouid record "69" for aii missing work. Converseiy, Guskey (2004) 
suggests that teachers consider giving "incompiete" grades untii stu- 
dents turn in missing assignments. Students with incompiete grades 
couid then be required to attend speciai after schooi or Saturday 
sessions. Another of Guskey's (2004, p. 52) suggestions is to change 
the grading scaie used. He states: 

Schoois using this approach shift from percentage grading 
scaies where, for exampie, A = 90-100%, B = 80-89%, C = 

70-79% and so on to whoie number scaies where A = 4, B = 

3, C = 2 and so on. in other words, aithough teachers can 
stiii assign zeros to student work that is missed, negiected, or 
turned in iate, the effect of a zero is iessened because it is 
not so extreme. 

Using Guskey's change in grading scaie suggestion, iet's return to 
Jason's grades: 

90 = A = 4 
92 = A = 4 

88 = B = 3 

90 = A = 4 

91 =A = 4 

89 = B = 3 
91 =A = 4 
0 = F =0 

On this scaie, Jason's average wouid be 3.25, corresponding to a B 
average. This B average more cioseiy reflects his overaii work in sci- 
ence this term than the 78.9 (0) he wouid receive on the percentage 
grading scaie, using the mean (inciuding the zero) to caicuiate the 
finai grade. 
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We can also use O'Connor's suggestion and replace Jason's 
zero with 69. Doing this gives Jason an ending average (using 
grades of 90, 92, 88, 90, 91, 89, 91, and 69) of 87.5 (B). Again, 
this grade appears more consistent with his performance in 
science than the final grade of 78.9. We should note, however, 
that this suggestion by O'Connor is hugely controversial. Some dis- 
tricts who have implemented this suggestion have found grades 
challenged by school boards and by parents. 



Let's return to Jason's grades for a moment and re-examine the 
use of different statistical methods to calculate a final grade. 
Jason has one zero. His other grades are: 90, 92, 88, 90, 91, 89, 
and 91. We've already established that using the mean (averag- 
ing) will result in a final grade of 78.9. What if we use the median? 
Placing Jason's grades in numerical order gives us: 0, 88, 89, 90, 
90, 91, 91, and 92. The median is 90. 



In summary, then, teachers can minimize the impact of zeros using 
several different strategies. They can: 1) use the highest possible "F 
score in place of the zero; 2) give "incomplete" grades for missing 





work; 3) use a whole number scale rather than a percentage scale; 
or 4) use the median to figure the final grade, rather than the mean. 
Teachers should use their professional judgment to decide which 
method will work the best In their classroom and which will best align 
with school, district, or state policy. 

The above discussion of grade reliability has brought to light some prob- 
lematic areas In grading. We have seen some very familiar and often 
used grading practices that may negatively Impact grade reliability. 
Rather than dwelling on problems, however, this publication attempts to 
propose solutions. The following summary from Butler & McMunn (2006) 
outlines a productive grading plan that will result In reliable grades. 



FIGURE A.2 

Creating a Productive Grading Plan 



IDEA 


EXAMPLE/EXPLANATION 


1. Grades are strictly tied to 
student achievement of 
curricular standard. 


Grades contain only information about student academic 
performance— no data on conduct, attendance, or attitude 
are included. 


2. Grades are meaningful 
and derived from quality 
assessment tasks. 


Valid assessments aligned with the standards are used to 
collect data. Example: A performance assessment is used to 
determine if a student can correctly operate a microscope 
(not a pencil-and-paper test). Sufficient data (multiple 
sources, triangulation) exists. 


3. Grades are figured from 
summative assessments. 


Assessment data is collected while students are learning new 
concepts, but such formative data are not used in figuring 
final grades. Grades on assignments are only taken (and 
counted) after students have had time to practice the new 
knowledge or skill and have received teedback on their 
pertormances. 


4. Zeros for missing work do 
not overly penalize students. 


Either O’Connor’s or Guskey’s suggestions for minimizing zeros 
are used. 


5. Grading policies are 

developed and shared with 
students and parents. 


Written grading policy statements are distributed to parents 
and students at the beginning of the year. Both parents and 
students are given opportunities to ask questions about the 
policies in order to clarify their understanding. 


6. Grading policies are 
standardized throughout a 
school/district. 


Schools and/or districts have regularly scheduled meetings 
to discuss grading practices. Teachers who teach the same 
subjects at the same grade levels have opportunities to work 
together and discuss scoring ot sample student work. 


7. Grades reflect students' 
current achievement levels. 


Use the median as the statistical method by which to 
calculate final grades or weight the most recent work more 
than earlier work ond/or only use summative assessments for 
figuring final grades. 



Grade Book Formats 

In order to Implement such a productive grading plan as the one 
outlined In FIGURE A.2, changes In record-keeping may be necessary. 
FIGURE A.3 provides one sample grade book format that may be use- 
ful In visualizing new ways of record-keeping. Note that this grade 
book Is divided Into two sides: A Formative Page and a Summative 
Page. We'll examine each page In detail. 




FIGURE A.3 

Sample Grade Book Format for Six-Week Grading 
Period 

Topic Covered: Weather 
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Formative Page 

On the Formative side of the grade book, we find severai different 
entries. The first is diagnostic, rather than formative and consists of 
a journai entry on weather concepts. The teacher has graded this 
with a /)-, / /- grading scheme. The teacher is not trying to record a 
grade for the diagnostic, but rather record a grade that wiii heip her 
remember the reiative knowiedge ieveis shown by the students. We 
assume that students making a /+ wouid have shown greater knowi- 
edge or understanding of weather concepts on this diagnostic than 
those making We hope that the teacher anaiyzed the answers 
from aii the students to heip her pian instruction on this unit. 

The next few entries on the Formative side of the grade book appear 
to be performance items, in which students were required to take 
accurate measurements with various weather instruments. For these 
measurements, the teacher has recorded "A," "NC," or "i" grades. 

"A" stands for Accurate measurement, "NC" is for inconsistent accu- 
racy, and "i" is for inaccurate measuring. Fiopefuiiy, students scoring 
NC or i were given further opportunities to practice and meaningfui 
feedback to heip them improve. 

The next two grades on the Formative Side (Ciassifying Ciouds and 
Weather Map-Symboi interpretation) are simpiy recorded as "number 
right/totai number." A simiiar grading scheme was used on the iast 
assignment recorded, the Terms Worksheet. Again, this information 
is for the purpose of determining who may need further practice in 
these areas. Note that two different students are missing assignments 
in these categories. 

The teacher appears to have used a rubric for the "Predicting and 
Forecasting" and "Concept Map" grades. These rubrics divided 
students into proficiency ieveis of Exceiient (4), Good (3), Adequate 
(2), and Re-do (1). We note that no students were required to re-do 
the assignments. The rubrics were anaiyticai, with ciear descriptions 
of each ievei of performance. Therefore, when students received 
the rubric scores, they couid cieariy identify their own strengths and 
weaknesses. The written descriptors couid aiso aid the students in 
improving their performances. 

We shouid note that none of the assignments iisted on the Formative 
side of the grade book are used to caicuiate the finai grade for the 
six-weeks grading period. The Formative side of the grade book is 
strictiy informationai in nature, it heips the teacher capture snapshots 
of student achievement during the iearning process. Fie can use such 
snapshots to differentiate instruction for his ciass. 

Summative Page 

The summative side of the grade book contains the grades that 
"counted." in other words, these are the grades that were used to 
determine the finai six-weeks grade. On this side, we find five sum- 
mative grades, each showing the weight of the assignment. There 
is a iab practicai exam that counts 30% of the finai grade, two quiz- 
zes (Terms and Ciouds) that each count 10% of the finai grade, a 
Forecasting Essay counting 30% of the finai grade, and a Muitipie- 
Choice Test counting 20% of the finai grade. 



Scores for all assignments except the Forecasting Essay are recorded 
as percentage grades. A rubric was used to grade the Forecasting 
Essay, and the grade conversions are given where the number of 
points earned were translated Into letter grades. 

All grades were then converted to a four-point soale, with A = 4, B = 

3, C = 2, D = 1, and F = 0. Finally, weighted scores were calculated 
Into the final weighted score, using the formula Indicated, and a final 
grade was assigned. 

Flow does this sample grade book format support the productive 
grading plan outlined In the previous section? Let's look at this point 
by point: 

1 . Grades are strictly tied to student achievement of 
curricular standard. 

The curricular standards being taught by this teacher come 
from the National Solenoe Education Standards for K-4. The 
first Is under the subtitle of Earth and Spaoe Solenoe (National 
Researoh Council, 1999, p. 134). One of these oontent stan- 
dards reads, "Weather changes from day to day and over 
the seasons. Weather can be described by measurable 
quantities, such as temperature, wind direction and speed, 
and preolpitatlon." Two other oontent standards from NSES 
are also being utilized by this teacher. These fall under the 
Science as Inquiry category and In the Abilities Neoessary to 
Do Solentific Inquiry suboategory (National Research Council, 
1999, p. 122). These are stated as: "Employ simple equipment 
and tools to gather data and extend the senses," and "Use 
data to oonstruot a reasonable explanation." 

The assignments given to the students In this six-week period 
align with one of these national standards. 

2. Grades are meaningfui and derived from quaiity 
assessment tasks. 

Without viewing the aotual sample of assessments used, this 
Is harder to discern. Flowever, we are able to tell that the 
teaoher provides opportunities for practice before taking 
summative grades and that the formative assessments do 
align with the summative ones. The teaoher utilizes rubrics to 
provide meaningful feedback to students. 

3. Grades are figured from summative assessments. 

No formative grades have been used to figure final grades. 

All grades that "count" are summative ones. 

4. Zeros for missing work do not overiy penaiize students. 

No zeros were reoorded In the summative seotlon of this grade 
book. Flowever, even If there were recorded zeros, the effect of 
the zero grades would be minimized as the teacher converts 
all grades to a four-point soale before figuring final grades. 

5. Grading poiicies are shared with students and parents. 

We have no data from this grade book to tell us If the teacher 
shared grading policies with students and parents. 



6. Grading policies are standardized throughout a 
school/district. 

We have no data that either supports or refutes this action by 
the teacher. 

7. Grades reflect students’ current achievement levels. 

The teacher uses the median as the statisticai method for 
figuring finai grades and oniy uses summative assignments to 
compute finai grades. 

By examining each productive grading practice, this anaiysis of 
the sampie grade book finds that five of the seven recommended 
grading practices ore exempiified by this teacher. 



APPLICATION 



Examine your own grade book. Does it exemplify a productive 
grading plan? If not, what changes could you make? 
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out the region. This staff is highly skilled in providing needs assess- 
ment services, conducting applied research in schools, and devel- 
oping processes, products, and programs that support educational 
improvement and increase student achievement. In the lost three 
years, in addition to its basic research and development work with 
over 170 southeastern schools, SERVE staff provided technical assis- 
tance and training to more than 18,000 teachers and administrators 
across the region. 

The SERVE Center is governed by o board of directors that includes 
the governors, chief state school officers, educators, legislators, and 
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o nationwide education knowledge network, building o bonk of 
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and regionally to improve student achievement. SERVE's Notional 
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improving student outcomes through the use of exemplary pre-K and 
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